the global built world

AI challenge methodology

Rationale:

• Domain Integration: By focusing on workflows commonly used within and across multiple real estate asset classes the competition ensures that participants must understand the nuances of commercial real estate.

• Technical Acumen: Crafting eﬀective prompts requires a deep understanding of LLM capabilities and limitations, encouraging creative approaches and iterative refinement.

• Industry Innovation: Bridging the gap between domain expertise and AI enables new applications, streamlining processes impacting the work of millions of people around the world.

Competition Structure and Phases

A. Asynchronous Competition

A six week period will be allowed to develop and submit entries

• Task Distribution: Participants will be given a set of commercial real estate workflows, with each set relevant to a particular asset class.

• For instance, an ‘Retail Leasing Workflow’ might include:

◦ Goal: Find the best tenants for a retail space.

◦ Tasks:

1. Identify ideal tenant profiles.

2. Analyse foot traﬃc data.

3. Compare market rents.

4. Generate outreach emails.

5. Summarise findings for stakeholders.

• Deliverables:

◦ A set of prompts for each sub-task, with annotations explaining the rationale behind each prompt’s design. The workflow might require just one prompt, or a series of iterative prompts. The key is which will be most eﬀective and eﬃcient.

◦ A brief outline of how the prompt(s) address both domain specifics and technical requirements.

• Evaluation: Judges (comprising commercial real estate experts and prompt engineering specialists) review submissions against a detailed rubric (see Section 4).

B. Live Competition

• Live Challenge: In a timed, live session, finalists craft prompts for an unannounced workflow scenario. This tests their on-the-spot thinking and ability to apply their knowledge under pressure.

• Team vs. Individual Options: Entrants can be individuals, or teams of up to four people.

• The ‘Wildcard’ Challenge: Individuals or Teams can come up with their own workflow they’d like to address.

For this challenge we are looking for novelty, imagination and inspiration.

3. Task Design and Example: Leasing Workflow

Using the ‘Retail Leasing Workflow’ example as a template, each task can be designed to test different aspects of the prompt engineer’s skills:

Task 1: Identify Ideal Tenant Profiles

• Domain Knowledge: Requires participants to incorporate factors such as customer demographics, brand alignment, and local market trends.

• Prompt Elements: Asks for a prompt that instructs the LLM to consider specific criteria (e.g. footfall patterns, competitor presence, and complementary retail categories).

Task 2: Analyse Foot Traﬃc Data

• Technical Focus: Requires participants to design a prompt that not only asks for data analysis but also handles structured data inputs (CSV, JSON) and visualisation outputs.

• Innovation: Encourages the inclusion of statistical analysis, such as peak hours identification and segmentation of traffic by demographics.

Task 3: Compare Market Rents

• Analytical Depth: The prompt would need to guide the LLM to extract and compare data from multiple sources, highlighting regional variations and temporal trends.

• Data Integration: Would test the participant’s ability to blend qualitative insights with quantitative comparisons.

Task 4: Generate Outreach Emails

• Creative Communication: Participants would need to craft a prompt that generates persuasive, tailored outreach emails that align with the identified tenant profiles.

• Language and Tone: Participants would need to evaluate how well the prompt directs the LLM to adopt a professional yet engaging tone, suitable for the commercial real estate industry.

Task 5: Summarise Findings for Stakeholders

• Conciseness and Clarity: The prompt would need to instruct the LLM to compile a concise report summarising key insights, actionable recommendations, and data visualisations.

• Stakeholder Communication: The prompt would need to place emphasis on producing a narrative that resonates with non-technical stakeholders while retaining technical accuracy.

4. Evaluation Criteria and Scoring Rubric

A detailed, multi-dimensional rubric, with weighted scores across several categories:

1. Domain Knowledge (30%):

◦ Accuracy and depth in commercial real estate context.

◦ Relevance of criteria and data considerations.

2. Prompt Engineering Skill (30%):

◦ Clarity, precision, and structure of the prompts.

◦ Technical eﬃcacy in instructing the LLM (e.g. handling of data formats, chain-of-thought strategies).

3. Creativity and Innovation (20%):

◦ Novel approaches, such as chaining prompts or integrating external data sources (in theory via Master

Prompts)

◦ The ability to design prompts that go beyond the obvious to solve problems in unique ways.

4. Performance and Outcome (20%):

◦ Measurable success based on the quality of the LLM’s outputs when using the prompts. (If requiring external data, participants can provide sample, synthetic data sets)

◦ Efficiency, reproducibility, and clarity in final responses.

Additional Considerations:

• Documentation and Rationale: Clear explanations of choices and expected outcomes.

• Adaptability: Ability to handle variations in data inputs or scenario changes.

the global built world

AI challenge methodology

challenge@builtworldchallenge.ai