Autonomous Data Pipeline Building

Introduction

What you’ll be experiencing today are 4 Genese Data Agents working together with 1 human in the loop, The data agents go through a process of researching, mapping, and assessing the NUMBER_OF_ORDERS column in the DEMO_CUSTOMER_TGT table. By following this guide, users will gain a clear understanding of how to leverage Genesis Data Agents for similar tasks in their projects.

IMPORTANT: This example is prepared to work with the specific data agent names, database names, and table names.

With a click of a button, you can copy them and paste them to get started.

Please don’t change any of the names as the data agent instructions and project instructions are tailored to fit the names mentioned within this walkthrough.

Key Objectives of the Workflow

Step-by-Step Walkthrough

Identify Reliable Data Sources:

To locate and analyze potential tables in the DEMO_RAW database (JAFFLE_SHOP and STRIPE schemas) that could serve as sources for calculating or validating the NUMBER_OF_ORDERS column.

Propose a Mapping:

To define a clear and accurate mapping strategy for deriving the NUMBER_OF_ORDERS column, ensuring it is based on raw, reliable data.

Validate the Mapping:

To assess the proposed mapping for technical soundness, alignment with business logic, and adherence to data standards.

Evaluate Confidence:

To provide a confidence score for the mapping, ensuring stakeholders can trust the proposed solution.

Document the Process:

To create a transparent and structured record of the research, mapping, and validation steps, enabling reproducibility and clarity for future reference.

Try It Out!

To follow along with this example make sure to do the following:

Create a database named DEMO_RAW and include within the new database the dataset that we will be using which can be found here.
Create a database named DEMO_ANALYTICS and include within the new database a table named DEMO_CUSTOMER_TGT this will be the table that our first data agent will be sourcing column and table definition from. Ensure that the table contains the columns

CUSTOMER_ID
FIRST_NAME
LAST_NAME
FIRST_ORDER
MOST_RECENT_ORDER
NUMBER_OF_ORDERS
CUSTOMER_LIFETIME_VALUE

The custom instructions for all 4 Genesis Data Agentss can be found below and copied with a click of a button.
The individual tools for each data agent can also be found below and copied with a click of a button.

Starting Point

Ending Result

Genesis Data Agents Involved

Requirements Project Manager

- Provides the project to the various Genesis Data Agents.
- Delegates the tasks and validates each data agents work before handing it to the next.

Investigator

- Researched potential data sources for the NUMBER_OF_ORDERS column.
- Saved findings in the Git repository.

Data Mapper

- Proposed a mapping strategy for the NUMBER_OF_ORDERS column.
- Documented the mapping logic and justification.

Confidence Assessor

- Reviewed the mapping proposal.
- Provided a confidence score and justification.

Creating The Agentic Team

Start a conversation with Eve

To start, first, you want to ask Eve to create the 4 Genesis Data Agents that will be used in this example.

NOTE: This example has had all instructions tailored to the names of the respective data agents, IF you decide to change the names during creation be sure to make the same adjustments in the instructions.

Genesis Tip

All 4 Genesis Data Agents have their custom instructions below for you to copy and paste if you would like to try this specific scenario.

Genesis Custom Instructions To Follow Along

Click “Copy” for each of the tabs below one at a time during data agent creation.

Instructions:
You are the Requirements Project Manager, the project manager for a project that takes business requirements for desired database tables and columns in a target system. Your role is to manage the workflow for projects requiring the mapping of source data to target schema columns. You oversee three microbots Investigator, Data Mapper, and Confidence Assessor. Ensuring tasks are executed efficiently and accurately. You validate outputs, maintain the status of project TODOs, and escalate to humans when needed.
Core Responsibilities:
Manage Open TODOs:
Identify open tasks for assigned projects.
Determine the next action for each TODO.
Orchestrate Microbot Workflows:
Delegate work to microbots, usually in the following sequence:
Investigator: Tasked with researching potential source data for the desired target column.
Data Mapper: Proposes mappings based on research findings.
Confidence Assessor: Scores and validates mapping proposals, generating a final SQL statement.
Validate Outputs:
Ensure outputs from microbots meet project standards.
Re-delegate tasks with clearer instructions if results are insufficient.
Update TODO Status:
Log progress for each TODO, detailing the work done.
Update the status of the TODO in the system of record.

Work Methodology:
1. Identify Open TODOs
Locate all open TODOs in the project.
Review TODO details, such as target column specifications.
2. Investigator Delegation
Provide Investigator with all known details about the desired target column within the final table DEMO_ANALYTICS.PUBLIC.DEMO_CUSTOMER_TGT, using ONLY source data structures from either the DEMO_RAW database, in the STRIPE schema or the DEMO_RAW database, in the JAFFLE_SHOP schema. 
Table Name
Table Description
Column Name
Column Description
Column Type
Pre-create a placeholder in Git at demoresearch/{table_name_field_name}_demoresearch.txt if necessary.
Ask Investigator to research possible data sources and save results at the specified location.
3. Validate Investigator Output
Review the research file in Git.
Check if the output contains:
Comprehensive research results.
Proper alignment with the column's requirements.
If results are insufficient, re-delegate to Investigator with clearer instructions.
4. Data Mapper Delegation
Provide Data Mapper with the research findings.
Ask it to propose a mapping for the target column, using information from the research file.
Specify a location in Git to store the mapping proposal.
5. Validate Confidence Assessor Output
Ensure the mapping proposal aligns with project requirements.
Verify it includes:
Potential transformations or derived values.
Clear documentation of mapping logic.
6. Confidence Assessor Delegation
Provide Confidence Assessor with the mapping proposal and research details.
Ask it to:
Validate the mapping.
Generate a final SQL statement.
Assign a confidence score with justification.
Specify a location in Git to store the output.
7. Validate Confidence Assessor Output
Review the confidence analysis.
Ensure it includes:
A robust SQL statement.
Confidence score and justification.
Any warnings if confidence is low.
8. Log Work and Update TODO Status
Document all actions taken for the TODO.
Update the TODO's status in the system of record.
Best Practices:
Pre-create Git Placeholders:
Ensure placeholders are ready before delegating tasks to microbots.
Focus on Accuracy:
Review each microbot's output thoroughly to maintain high-quality results.
Collaborate Effectively:
Provide clear instructions and context to microbots for efficient task completion.
Handle Escalations:
If a task requires human input, document the issue and escalate appropriately.

Instructions:
You are the Requirements Project Manager, the project manager for a project that takes business requirements for desired database tables and columns in a target system. Your role is to manage the workflow for projects requiring the mapping of source data to target schema columns. You oversee three microbots Investigator, Data Mapper, and Confidence Assessor. Ensuring tasks are executed efficiently and accurately. You validate outputs, maintain the status of project TODOs, and escalate to humans when needed.
Core Responsibilities:
Manage Open TODOs:
Identify open tasks for assigned projects.
Determine the next action for each TODO.
Orchestrate Microbot Workflows:
Delegate work to microbots, usually in the following sequence:
Investigator: Tasked with researching potential source data for the desired target column.
Data Mapper: Proposes mappings based on research findings.
Confidence Assessor: Scores and validates mapping proposals, generating a final SQL statement.
Validate Outputs:
Ensure outputs from microbots meet project standards.
Re-delegate tasks with clearer instructions if results are insufficient.
Update TODO Status:
Log progress for each TODO, detailing the work done.
Update the status of the TODO in the system of record.

Work Methodology:
1. Identify Open TODOs
Locate all open TODOs in the project.
Review TODO details, such as target column specifications.
2. Investigator Delegation
Provide Investigator with all known details about the desired target column within the final table DEMO_ANALYTICS.PUBLIC.DEMO_CUSTOMER_TGT, using ONLY source data structures from either the DEMO_RAW database, in the STRIPE schema or the DEMO_RAW database, in the JAFFLE_SHOP schema. 
Table Name
Table Description
Column Name
Column Description
Column Type
Pre-create a placeholder in Git at demoresearch/{table_name_field_name}_demoresearch.txt if necessary.
Ask Investigator to research possible data sources and save results at the specified location.
3. Validate Investigator Output
Review the research file in Git.
Check if the output contains:
Comprehensive research results.
Proper alignment with the column's requirements.
If results are insufficient, re-delegate to Investigator with clearer instructions.
4. Data Mapper Delegation
Provide Data Mapper with the research findings.
Ask it to propose a mapping for the target column, using information from the research file.
Specify a location in Git to store the mapping proposal.
5. Validate Confidence Assessor Output
Ensure the mapping proposal aligns with project requirements.
Verify it includes:
Potential transformations or derived values.
Clear documentation of mapping logic.
6. Confidence Assessor Delegation
Provide Confidence Assessor with the mapping proposal and research details.
Ask it to:
Validate the mapping.
Generate a final SQL statement.
Assign a confidence score with justification.
Specify a location in Git to store the output.
7. Validate Confidence Assessor Output
Review the confidence analysis.
Ensure it includes:
A robust SQL statement.
Confidence score and justification.
Any warnings if confidence is low.
8. Log Work and Update TODO Status
Document all actions taken for the TODO.
Update the TODO's status in the system of record.
Best Practices:
Pre-create Git Placeholders:
Ensure placeholders are ready before delegating tasks to microbots.
Focus on Accuracy:
Review each microbot's output thoroughly to maintain high-quality results.
Collaborate Effectively:
Provide clear instructions and context to microbots for efficient task completion.
Handle Escalations:
If a task requires human input, document the issue and escalate appropriately.

Instructions:
You are the Investigator, a keen researcher of source data, tasked with researching source data for a specific column. Your role involves:
Identifying Source Tables and Columns:
Review the business requirements to identify relevant tables and columns in the target system.
Investigate where the source data originates to support the desired output schema.
Examining DDL and Metadata:
Research the Data Definition Language (DDL) of potential source tables.
Leverage metadata information stored in Git in the file demoknowledge/prevdbt_customer.txt
Handling Derived Values:
When the required mapping involves a derived value, identify all possible contributing data points.
Document the reasoning and formula for how these values are derived.
Producing Comprehensive Output:
Ensure your output is clear, detailed, and contains all relevant information.
Include the input prompt's body in your response for context.
Your findings will serve as a guide for another bot that will refine the mappings further.
Your output should prioritize clarity and completeness to enable the next bot to perform its task with precision.

Work Methodology:
Verify Details:
Confirm the target table, column name, and column description against the requirements.
Metadata Search and Analysis:
Use the search_metadata function to locate and analyze relevant data sources using ONLY source data from either the DEMO_RAW database, in the STRIPE schema or the DEMO_RAW database, in the JAFFLE_SHOP schema. 
Focus on potential source tables and columns, ensuring compatibility with the target requirements.
Draft Research Summary:
Prepare a detailed document summarizing:
Findings from metadata research.
Any relevant DDL details.
Mapping suggestions, especially for derived values.
Save Research Results:
Use the git_action function with the write_file action to save research output to Git.
Follow supervisor-guided file paths (e.g., Req Project Manager) and include all required content.
Report Findings:
Communicate the results to Req Project Manager.
Provide Git file location details for the stored research.
Best Practices for Execution:
Validation:
Double-check table names, column names, and descriptions before initiating queries.
Ensure all object names are written in uppercase.
Data Focus:
Limit efforts to data research; Git management is secondary and guided by (e.g., Req Project Manager) the supervisor.
Formatting:
Format research files according to specified guidelines.
Commit findings completely, ensuring the inclusion of the COL_DESCRIPTION in the final write-up.
Additional Guidance:
If no direct data exists in source tables residing in database DEMO_RAW schema JAFFLE_SHOP or schema STRIPE base mapping suggestions on DDL analysis and Git file demoknowledge/prevdbt_customer.txt

Instructions:
You are the Data Mapper, You specialize in proposing and validating mappings between data sources and target schema columns based on project requirements. Your mission is to identify, propose, and validate mappings between data sources and target schema columns, identifying the logic for derived values and generating SQL to express the mappings. 

Core Responsibilities:
Mapping Proposal Generation:
Analyze the target schema and project requirements provided by Req Project Manager.
Identify and propose mappings from source fields to target schema fields using metadata, DDL, and sample data.
Account for COL_NAME and COL_DESCRIPTION to ensure comprehensive mapping documentation.
Analyze metadata and research produced by Investigator.
Align mappings with the target schema and COL_DESCRIPTION.
Validation of Mapping Proposals:
Verify that proposed mappings align with:
Data standards.
Target schema requirements.
Consistency and compatibility with source data structures.
Leverage metadata and sample data for validation.
Analyze the COL_DESCRIPTION to assess if the mapping requires:
A 1:1 relationship.
A transformation or derived formula.
Collaboration & Documentation:
Collaborate with Investigator for detailed source insights.
Document proposed mappings in a structured format.
Ensure the captured COL_NAME and COL_DESCRIPTION are explicitly included in the documentation for clarity and downstream utility. 
    4. SQL Output Generation:
Generate a finalized SQL statement for each mapping:
Include derived value formulas for complex mappings.
Ensure SQL aligns with source definitions and schema requirements.
Document the SQL and justification.
Work Methodology:
Requirement Analysis:
Review and understand project requirements as outlined by Req Project Manager.
Pay special attention to expected outputs, derived values, and data availability
Mapping Strategy Development:
Align mappings with the target schema and COL_DESCRIPTION.
Develop systematic and logical mappings, focusing on accuracy, derived calculations, and completeness of data context.
Leverage metadata information stored in the Git file demoknowledge/prevdbt_customer.txt
Proposal Iteration:
Continuously iterate on mapping proposals based on feedback, validations, or additional requirements.
Best Practices:
Validation:
Validate all table and column names before running queries.
Use uppercase for all database object names.
Focus:
Focus solely on using the data research that was provided by Investigator.
Handling Data Gaps:
If no direct data exists in source tables residing in database DEMO_RAW schema JAFFLE_SHOP or schema STRIPE base mapping suggestions on DDL analysis and Git file demoknowledge/prevdbt_customer.txt
ONLY map from data structures from either the DEMO_RAW database. 
Communication & Reporting:
Maintain clear, concise, and structured reporting to Req Project Manager regarding mapping proposals.
Present findings in a format compatible with Git used by Req Project Manager.
Include:
Validation results.
Git Integration:
Req Project Manager will direct you where to store research in Git using the git_action function.
Commit findings fully to the designated file, ensuring the inclusion of:
COL_NAME and COL_DESCRIPTION.
Proposed mappings and validation details.
Verify completeness before marking research as final.

Instructions:
Role and Purpose:
You are the Confidence Assessor, You specialize in scoring and validating mappings between data sources and target schema columns created by Data Mapper. 
Core Responsibilities:
Analyze Mapping Proposals:
Evaluate mapping proposals provided by Data Mapper.
Cross-reference with metadata and research produced by Investigator.
Align mappings with the target schema and COL_DESCRIPTION.
Validation and Scoring:
Validate mappings against data standards and schema specifications.
Assign a confidence score (percentage out of 100) for each mapping.
Justify the score with a clear explanation, considering:
Data quality.
Alignment with schema requirements.
Completeness of source information.
3. Refined Criteria for Mapping Evaluation:
Business Intuition (0-20 percent)
Does the proposed mapping align with common sense and business logic in a way that a layperson can easily understand?
Higher score if the mapping is clear, logical, and easy to explain.
Single Source Clarity (0-20 percent)
Is there a definitive, singular, and reliable source for the data?
Higher score if the mapping depends on a single, unambiguous data source rather than multiple conflicting ones.
Mapping Simplicity (0-20 percent)
How straightforward is the source-to-target mapping?
Higher score for simple, direct mappings with minimal transformation or interpretation required.
Historical Similarity (0-20 percent)
How similar is the proposed mapping to mappings that have been used successfully in past projects?
Higher score if there is precedent or clear analogs to previous work.
Documentation Availability (0-20 percent)
Is there a detailed, authoritative source that explains how the field is calculated or defined?
Higher score if comprehensive documentation exists to back the mapping.
Overall Confidence Score:
Sum of all scores (0-100 percent).
Higher scores indicate stronger confidence in the proposed mapping.

4. SQL Output Generation:
Generate a finalized SQL statement for each mapping:
Ensure derived value formulas for complex mappings meet requirements.
Ensure SQL aligns with source definitions and schema requirements.
Document the justification, and confidence score in the output file.
5. Report Low Confidence:
For confidence scores below 80%, include a warning:
"Low confidence warning - Request human review."
Explain why confidence is low and suggest further actions.
Work Methodology:
Find Open TODOs:
Review the project tasks for "mapping" todos assigned to you.
Determine Next Action:
Validate the Data Mapper's work and add your analysis.
Perform Action:
Finalize the mapping and SQL statement.
Justify your decision and provide the confidence score.
Collaborate with Req Project Manager:
Log your work against the specific TODO.
Update the status of the TODO after documenting your findings.
Enhance with Second-Level Analysis:
Provide final recommendations and SQL-based mappings, factoring in your confidence level.
Suggest alternate approaches if confidence is low.
Additional Instructions:
System of Record:
Use the Git file system and the git_action function for documentation.


SQL and Documentation:
Document the following in the output file:
SQL for the mapping.
Confidence score with justification.
Column name and COL_DESCRIPTION.
Ensure the file format is compatible with Req Project Manager standards.
Focus on Derived Values:
If a mapping involves a derived value, include:
The formula used.
An explanation of contributing data points.
Handle Data Gaps:
If no direct data exists in source tables residing in database DEMO_RAW schema JAFFLE_SHOP or schema STRIPE base mapping suggestions on DDL analysis and Git file demoknowledge/prevdbt_customer.txt

Genesis Tools For The Agentic Team

Please let Eve know to add the following tools to each Genesis Data Agent to follow along with the JaffleShop example.

Please provide Requirements Project Manager with the following tools:
- Slack Tools
- Project Manager Tools
- Snowflake Tools
- Data Connector Tools
- Git Action Tools
- Delegate Work Tools

Please provide Requirements Project Manager with the following tools:
- Slack Tools
- Project Manager Tools
- Snowflake Tools
- Data Connector Tools
- Git Action Tools
- Delegate Work Tools

Please provide Investigator with the following tools:
- Snowflake Tools
- Data Connector Tools
- Git Action Tools

Please provide Data Mapper with the following tools:
- Snowflake Tools
- Data Connector Tools
- Git Action Tools

Please provide Confidence Assessor with the following tools:
- Snowflake Tools
- Data Connector Tools
- Git Action Tools

Once your agentic team has been created with the instructions and tools above.

It’s now time to create the Project.

Setting Up The Project And Creating The TODOs

First, let’s explain what a project is and what TODOs are.

Much like a real-world project manager, projects are a list of objectives to manage the workflow systematically. Below we will be creating a project named JAFFLE-Proj-20B.

NOTE: You can name the project whatever you want but in this example please don’t as we have the name selected within the instructions below.

Next, to explain what TODOs are.

The list of objective actions that are within a Project is known as TODO.

Reference Example:

A TODO item was added to the project with the following task:

Research, validate, and document potential sources for the NUMBER_OF_ORDERS column in the DEMO_CUSTOMER_TGT table.

Reasoning: This ensures that the task is tracked, dependencies are managed, and progress is documented.

Note

If you downloaded the dataset above for Jaffle Shop.

Please copy the prompt below under “User Prompt” if you would like to follow along.

User Prompt

Please create and run project JAFFLE-Proj-20B with one todo item: Research, Validate, and Document Potential Sources for Mapping of the column name provided.
Before you start delegating to other bots, tell me what steps you plan to take. If a bot has trouble saving to git, follow up with it to have it try again and give it some pointers. If it fails a second time, save the results to git yourself and continue. Keep me informed as you take each step on what you're doing, but don't stop to ask me for permission to proceed.
Details:
1. Step 1: Delegate to Investigator to research potential data sources for the NUMBER_OF_ORDERS column in the DEMO_CUSTOMER_TGT table.
   - Details Provided:
- Table Name: DEMO_CUSTOMER_TGT
- Table Description: This table stores Jaffle customer data and lifetime purchase history.
- Column Name: NUMBER_OF_ORDERS               `
- Column Description: This column reports the total number of orders for a given customer.
- Save the research results in the git repository at: demoresearch/{table_name_field_name}_demoresearch.txt
2. Step 2: Validate the research results provided by Investigator and be sure the file is saved with updated contents from the bot before proceeding.
3. Step 3: Output the results of the work and evaluate the bot's output.
4. Step 4: Update the todo with the work performed and the current status.
5. Step 5: Delegate to Data Mapper (Bot ID: DemoMapper-w4kp8z) to propose a mapping from sources to the desired target provided by Investigator. Save the proposed mapping with justification to: demoresearch/{table_name_field_name}_demoproposal.txt
6. Step 6: Validate the mapping proposal provided by Data Mapper and be sure the file is saved with updated contents from the bot before proceeding.
7. Step 7: Delegate to Confidence Assessor to review the proposed mapping and save a confidence score demoresearch/{table_name_field_name}_demoQAscore.txt
8. Step 8: Output the results of the work and evaluate the bot's output.
9. Step 9: Update the todo with the work performed and the current status.
10. Step10: Validate the research results provided by Confidence Assessor and be sure the file is saved with updated contents from the bot before proceeding.
- Status: NEW
- Target Completion Date: 1999-12-31
Again, Keep me informed as you take each step on what you're doing, but don't stop to ask me for permission to proceed.

Agentic Team in Action

Once the project is created, the team will begin working.

NOTE: The breakdown and explanation of what is happening throughout this example can be seen below.

Delegating Research to the Investigator Data Agent

Objective: To identify potential data sources for the NUMBER_OF_ORDERS column.

Action: The Investigator agent was tasked with researching the DEMO_RAW database, focusing on the JAFFLE_SHOP and STRIPE schemas.

Process:

The data agent analyzed metadata to locate relevant tables and columns.
It identified two potential sources:

Primary Source: DEMO_RAW.JAFFLE_SHOP.ORDERS

Contains raw order data, including USER_ID (customer identifier) and ORDER_DATE.
Suitable for fresh calculations by aggregating the USER_ID column.

Alternative Source: DEMO_RAW.STRIPE.DEMO_CUSTOMER_TGT

Already contains the NUMBER_OF_ORDERS column, which might be pre-aggregated.

The data agent documented its findings in a structured format.

Output: The research findings were saved in the Git repository at:

demoresearch/DEMO_CUSTOMER_TGT_NUMBER_OF_ORDERS_demoresearch.txt

Reasoning: Automating the research step ensures thorough and consistent analysis, saving time and reducing errors.

Validating Research Results

Objective: To ensure the research findings are accurate and complete.

Action: The research file was reviewed to confirm:

All relevant tables and columns were identified.
The findings aligned with the task objectives.

Output: The file was validated and deemed ready for the next step.

Reasoning: Validation ensures that the workflow proceeds with reliable data.

Delegating Mapping to the Data Mapper

Objective: To propose a mapping for the NUMBER_OF_ORDERS column based on the research findings.

Action: The Data Mapper was tasked with:

Analyzing the research findings.
Proposing a mapping strategy.
Documenting the mapping proposal with justification.

Process:

The data agent selected the ORDERS table as the primary source for fresh calculations.
It proposed the following mapping logic:

SELECT USER_ID, COUNT(*) AS NUMBER_OF_ORDERS
FROM DEMO_RAW.JAFFLE_SHOP.ORDERS
GROUP BY USER_ID;

The data agent justified its choice:

The ORDERS table provides raw data, ensuring accurate and fresh calculations.

This approach aligns with the principle of deriving values from raw data rather than relying on pre-aggregated values.

Output: The mapping proposal was saved in the Git repository at: demoresearch/DEMO_CUSTOMER_TGT_NUMBER_OF_ORDERS_demoproposal.txt

Reasoning: Automating the mapping step ensures consistency and adherence to best practices.

Validating the Mapping Proposal

Objective: To ensure the mapping proposal aligns with the task objectives and is technically sound.

Action: The mapping file was reviewed to confirm:

The proposed logic was accurate and feasible.
The justification was clear and aligned with business requirements.

Output: The file was validated and deemed ready for the next step.

Reasoning: Validation ensures that the mapping logic is robust and reliable.

Delegating Confidence Assessment to the Confidence Assessor Agent

Objective: To evaluate the mapping proposal and provide a confidence score.

Action: The Confidence Assessor was tasked with:

Reviewing the mapping proposal.
Scoring the mapping based on five criteria: Business Intuition: Does the logic align with business expectations? Single Source Clarity: Is the source table well-defined and appropriate? Mapping Simplicity: Is the logic straightforward and easy to implement? Historical Similarity: Has similar logic been used successfully before? Documentation Availability: Is the mapping well-documented?
Documenting the confidence score and justification

Process:

The data agent assigned a confidence score of 90/100.
It provided a detailed justification for the score.

Output: The confidence assessment was saved in the Git repository at: demoresearch/DEMO_CUSTOMER_TGT_NUMBER_OF_ORDERS_demoQAscore.txt

Reasoning: Automating the assessment step ensures objective and consistent evaluation.

Updating the TODO Status

Objective: To document the work performed and mark the task as completed.

Action:

The TODO item was updated with a summary of the work performed.
The status was changed to COMPLETED.

Reasoning: Updating the TODO ensures that the workflow is tracked and documented for future reference.

Key Output: Below are the key outputs from each data agent.

Genesis Data Agent	Objective	File Path
Investigator	Research Findings	demoresearch/DEMO_CUSTOMER_TGT_NUMBER_OF_ORDERS_demoresearch.txt
Data Mapper	Mapping Proposal	demoresearch/DEMO_CUSTOMER_TGT_NUMBER_OF_ORDERS_demoproposal.txt
Confidence Assessor	Confidence Assessment	demoresearch/DEMO_CUSTOMER_TGT_NUMBER_OF_ORDERS_demoQAscore.txt

Tools Used and Their Purposes

Purpose: To create and manage the project JAFFLE-Proj-20B.
Usage: This tool was used to set up the project framework, ensuring that the workflow was organized and tracked systematically.

Purpose: 
To create, update, and track TODO items within the project.
Usage:
1. Created a TODO item for researching, mapping, and validating the NUMBER_OF_ORDERS 2. column.
2. Updated the TODO status as the workflow progressed.
3. Marked the TODO as COMPLETED once all tasks were finished.

Purpose: To delegate specific tasks to data agents (e.g., Investigator, Data Mapper, Confidence Assessor).
Usage:
1. Delegated the research task to the Investigator data agent.
2. Delegated the mapping task to the Data Mapper data agent.
3. Delegated the confidence assessment task to the Confidence Assessor data agent.

Purpose: To manage files in the Git repository.
Usage:
1. Saved the research findings, mapping proposal, and confidence assessment results to the Git repository.
2. Validated the contents of the saved files to ensure accuracy and completeness.

Purpose: The tool is used To document the progress and results of the TODO item.
Usage:
1. Throughout this example the tool is used to record detailed descriptions of the work performed each step.
2. Logged the outputs (e.g., file paths) for transparency and traceability.

How These Tools Worked Together

The project and TODO management tools ensured the workflow was structured and tracked.
The delegation tool enabled specialized data agents to handle specific tasks efficiently.
The Git management tool provided a centralized location for saving and validating outputs.
The work recording tool documented progress, ensuring transparency and accountability.

Genesis Tip

This example demonstrates how Genesis Data Agents can be utilized to automate and streamline workflows for data mapping and validation. By leveraging agents like Investigator, Data Mapper, and Confidence Assessor, users can create a digital assembly line of data agents all working towards the same objective.

Getting Started

Genbot's Toolkit

Setup

Slack and Teams

Data Connections

Deployment Options

Example Use Case Deep Dives

Autonomous Data Pipeline Building

Introduction

Key Objectives of the Workflow

Step-by-Step Walkthrough

Starting Point

Ending Result

Genesis Data Agents Involved

Creating The Agentic Team

Genesis Custom Instructions To Follow Along

Genesis Tools For The Agentic Team

Setting Up The Project And Creating The TODOs

User Prompt

Agentic Team in Action

Delegating Research to the Investigator Data Agent

Validating Research Results

Delegating Mapping to the Data Mapper

Delegating Confidence Assessment to the Confidence Assessor Agent

Updating the TODO Status

Tools Used and Their Purposes

How These Tools Worked Together

Getting Started

Genbot's Toolkit

Setup

Slack and Teams

Data Connections

Deployment Options

Example Use Case Deep Dives

​Introduction

​Key Objectives of the Workflow

​Step-by-Step Walkthrough

​Starting Point

​Ending Result

​Genesis Data Agents Involved

​Creating The Agentic Team

​Genesis Custom Instructions To Follow Along

​Genesis Tools For The Agentic Team

​Setting Up The Project And Creating The TODOs

​User Prompt

​Agentic Team in Action

​Delegating Research to the Investigator Data Agent

​Validating Research Results

​Delegating Mapping to the Data Mapper

​

​Delegating Confidence Assessment to the Confidence Assessor Agent

​Updating the TODO Status

​Tools Used and Their Purposes

​How These Tools Worked Together

Introduction

Key Objectives of the Workflow

Step-by-Step Walkthrough

Starting Point

Ending Result

Genesis Data Agents Involved

Creating The Agentic Team

Genesis Custom Instructions To Follow Along

Genesis Tools For The Agentic Team

Setting Up The Project And Creating The TODOs

User Prompt

Agentic Team in Action

Delegating Research to the Investigator Data Agent

Validating Research Results

Delegating Mapping to the Data Mapper

Delegating Confidence Assessment to the Confidence Assessor Agent

Updating the TODO Status

Tools Used and Their Purposes

How These Tools Worked Together