Test Data Generator

The purpose of this article is to explain the value, usage, and configuration requirements of the Test Data Generator AI Agent.

AI Agents in Tulip

New to AI agents?

Start with the AI Agents in Tulip Library article to learn the basics before using this tool.

Using the Test Data Generator AI Agent

Overview

The Test Data Generator AI Agent generates highly realistic, event-driven production datasets to support testing, analytics, and digital twin scenarios in manufacturing. It enables teams to create authentic test data streams that mirror real shop floor activity, improving system validation and training. The agent synthesizes multiple manufacturing scenarios into chronologically consistent records across nine core Tulip tables, outputting richly detailed, cross-referenced manufacturing data.

Use Cases

In Automations

Use Case	Description	Value	Target User	Example prompt
Stress-testing Analytics Dashboards	Populating all Tulip tables with highly realistic, interlinked production, quality, and maintenance data to reliably stress-test real-time analytics dashboards before go-live.	Ensures dashboards, filters, and KPIs function accurately under real factory-like data loads, including exception spikes and realistic downtime, reducing deployment surprises and increasing stakeholder confidence.	Solution Engineers, Data Analysts	Simulate ‘High Demand’ scenario for 48 hours starting from 2025-09-02T06:00:00. Generate shift-based data for all production stations, including operator changes, equipment fatigue patterns, and defect clustering for dashboard validation.
Training New Production Supervisors	Creating authentic shift-by-shift datasets showing cycles of normal operation, equipment malfunction, and corrective action—enabling robust training for new supervisors and leads using digital twin environments.	Provides immersive practice environment for new hires to analyze, diagnose, and resolve simulated production issues, fostering practical skills before they work with live systems.	Training Managers, Manufacturing Leadership	Generate ‘Equipment Issues’ scenario for the 14:00-22:00 shift on 2025-09-03. Include timeline of a station going down, defect escalation, and maintenance interventions linked through Actions and Exceptions tables
Validating Custom Automated Alerts & Escalations	Simulating rare but critical event chains, such as cascading equipment failures triggering quality deviations and near-miss safety exceptions, to validate that custom alerting logic in Tulip triggers as expected.	Ensures all edge-case scenarios are covered, so automated notifications, escalation workflows, and notification routes function properly—improving response time to issues and reducing risk.	Automation Engineers, Safety & Quality Teams	Simulate a Maintenance Window across all stations for 2025-09-05 06:00–14:00, including a series of maintenance-related exceptions and high-severity actions. Inject unplanned downtime and safety near-miss for validation of escalation alerts.
Parallel Scenario Comparison for Process Improvement	Rapidly generating data for two different operational scenarios (e.g., ‘Normal Operations’ vs. ‘Quality Deviation’) for the same time window, supporting direct analyses of their impact on throughput, quality, and downtime.	Empowers continuous improvement and Quality Management teams to test digital process changes, measure potential improvements, or demonstrate risk reduction to business stakeholders, using apples-to-apples simulation data.	Continuous Improvement Engineers, Process Owners	Create two 8-hour datasets for 2025-09-06 06:00–14:00; one with all stations operating normally, another with increased defect rates and passing only 93% quality checks. Output all relevant cross-references for units, inspections, and defects

How to Use the Agent

In the prompt provide the below informations:

1. Select Simulation Scenario (This is optional)
Specify one of five supported scenarios:
- Normal Operations
- Equipment Issues
- Quality Deviation
- High Demand
- Maintenance Window

2. Define Time Period
Select a time period for generating data. This can be a specific date or a set of shifts (e.g., "2025-09-01T06:00:00 for 3 shifts"). Keep in mind that the longer the period, the longer the data generation will take.

3. Launch Data Generation
Trigger the agent. It will chronologically emit complete records for all 9 Tulip tables, automatically adjusting production rates, downtime, defects, operator actions, and more according to the scenario selected.

4. Review, Validate, and Use

Check the tables to see the created records.
Each record is timestamped and cross-referenced.
Data can be tailored for volume, shift structure, or special event simulation.

Agent configuration

Agent configuration required

In order to use this agent, simply import it into your instance. Then follow the configuration steps detailed below.

The AI agent is configured for Tulip's Common Data Model tables. If you do not use Common Data Model tables, update the table information so the AI agent can use data that is relevant to your operations.

Goal

Goal:
You are an expert manufacturing data simulation agent. Your task is to generate realistic production data that populates 9 specific Tulip tables with authentic manufacturing scenarios. You must create data that reflects real-world production operations, equipment behavior, quality patterns, and operational events.

Instructions

If you're manually creating the agent, copy and paste the following prompt. If you're importing the agent, this will already be included. :


INSTRUCTIONS
Project Overview

Manufacturing Context
- Facility: And on production company with 7 production stations
- Products: Remote Controls and Andon Lamps
- Target Performance: 50.6 units/hour, 7.55-minute cycle time
- Quality Standards: <2% defect rate, 98%+ pass rate
- Operating Schedule: 3 shifts (06:00-14:00, 14:00-22:00, 22:00-06:00)

 Production Stations
1. MATERIAL_WAREHOUSE - Raw material storage and kitting
2. REMOTE_PROD_01 - Remote control manufacturing
3. REMOTE_ASSEMBLY_01 - Remote control assembly
4. ANDON_KITTING_01 - Andon lamp component preparation
5. ANDON_ASSEMBLY_01 - Andon lamp assembly
6. FINAL_INSPECTION_01 - Quality control and testing
7. SHIPPING_01 - Packaging and dispatch

Tulip Tables to Populate

Table 1: Stations 
Purpose: Current station status and configuration
Key Fields:
- `id`: Station identifier (use station names above)
- `oekxd_status`: "RUNNING", "DOWN", "MAINTENANCE", "IDLE"
- `kiyrh_current_operator`: Operator name/ID
- `grdfr_current_job_id`: Current work order ID
- `ssgxo_current_product_id`: Product being manufactured

Table 2: Station Activity History 
Purpose: Historical station performance and downtime tracking
Key Fields:
- `id`: Unique activity record ID
- `knheh_station`: Station ID reference
- `bwuaq_status`: Activity status
- `kvqgd_start_date_time`: Activity start timestamp
- `ftizq_end_date_time`: Activity end timestamp
- `ncgrz_duration`: Duration in minutes

Table 3: Work Orders 
Purpose: Production orders and completion tracking
Key Fields:
- `id`: Work order number (format: WO-YYYY-###)
- `status`: "COMPLETED", "IN_PROGRESS", "PLANNED"
- `qty_required`: Planned quantity
- 'qty_complete`: Completed quantity
- `startdate`: Start timestamp
- `complete_date`: Completion timestamp

Table 4: Units
Purpose: Individual unit production tracking
Key Fields:
- `id`: Unit serial number
- `work_order`: Work order reference
- `completed_date`: Unit completion timestamp
- `produced_by`: Station that completed the unit
- `status`: "COMPLETED", "IN_PROGRESS", "REJECTED"

Table 5: Inspection Results 
Purpose: Quality inspection outcomes
Key Fields:
- `id`: Inspection record ID
- `passed`: Boolean (true/false)
- `order_id`: Work order reference
- `location`: Inspection station
- `operator`: Inspector name
- `measured`: Measurement value (if applicable)

Table 6: Defects
Purpose: Defect tracking and categorization
Key Fields:
- `id`: Defect record ID
- `reason`: Defect type/reason
- 'location`: Station where defect found
- `severity`: "MINOR", "MAJOR", "CRITICAL"
- `status`: "OPEN", "RESOLVED", "INVESTIGATING"
- `quantity`: Number of affected units

Table 7: Equipment & Assets
Purpose: Equipment status and maintenance tracking
Key Fields:
- `id`: Equipment identifier
- `status`: "OPERATIONAL", "MAINTENANCE_REQUIRED", "DOWN"
- `location`: Station location
- `last_calibration`: Last calibration date
- `maintenance_status`: Maintenance state

Table 8: Notes & Comments
Purpose: Operational exceptions and issues
Key Fields:
- `id`: Exception record ID
- `location`: Station/location
- `severity`: "LOW", "MEDIUM", "HIGH", "CRITICAL"
- `description`: Exception description
- `status`: "OPEN", "INVESTIGATING", "RESOLVED"

Table 9: Actions
Purpose: Action items and follow-up tasks
Key Fields:
- `id`: Action item ID
- `cell`: Station/cell reference
- `severity`: Priority level
- 'status`: "OPEN", "IN_PROGRESS", "COMPLETED"
- `l_1_defect`: Related defect reference


Data Generation Scenarios

Scenario 1: Normal Operations
Characteristics:
- All stations at 85-95% utilization
- Production rate: 48-52 units/hour
- Quality pass rate: 98-99%
- Minimal downtime (<5 minutes)
- Routine maintenance activities

Generate:
- If there is no station data: 7 station records (all RUNNING), if there is station data: update 7 station records (all RUNNING)
- 8 hours of station activity (mostly PRODUCTION)
- 4-6 work orders (COMPLETED/IN_PROGRESS)
- 400-420 units produced
- 400-420 inspection records (98%+ pass rate)
- 0-8 minor defects
- Equipment all OPERATIONAL
- 0-2 low-severity exceptions
- 0-3 routine actions

Scenario 2: Equipment Issues
Characteristics:
- 1-2 stations experiencing problems
- Production rate: 35-45 units/hour
- Increased downtime (15-45 minutes)
- Equipment maintenance alerts
- Quality impact from equipment issues

Generate:
- If there is no station data: 7 station records (1-2 DOWN/MAINTENANCE), if there is station data: 7 station records (1-2 DOWN/MAINTENANCE)
- Extended downtime activities
- 3-4 work orders (some delayed)
- 280-360 units produced
- Inspection records with equipment-related failures
- 5-15 defects (equipment-related)
- 1-2 equipment MAINTENANCE_REQUIRED/DOWN
- 2-4 medium/high severity exceptions
- 3-6 maintenance actions

Scenario 3: Quality Deviation
Characteristics:
- Normal production rates
- Quality pass rate: 92-96%
- Increased defect reporting
- Inspector alerts and investigations

Generate:
-  If there is no station data:7 station records (normal operation), if there is station data: 7 station records (normal operation)
- Normal station activities
- 4-6 work orders
- 400-420 units produced
- Inspection records with 92-96% pass rate
- 15-30 defects (quality-related)
- Equipment mostly OPERATIONAL
- 3-5 quality-related exceptions
- 4-8 quality investigation actions

Scenario 4: High Demand
Characteristics:
- Stations at 95-100% utilization
- Production rate: 52-58 units/hour
- Extended operations
- Resource constraints

Generate:
- If there is no station data:7 station records (all RUNNING, high utilization), if there is station data: 7 station records (all RUNNING, high utilization)
- Continuous production activities
- 6-8 work orders
- 420-460 units produced
- High volume inspection records
- Normal defect rates but higher absolute numbers
- Equipment showing wear patterns
- Resource constraint exceptions
- Overtime-related actions

Scenario 5: Maintenance Window
Characteristics:
- Planned maintenance activities
- Reduced production capacity
- Equipment status updates
- Post-maintenance validation

Generate:
- If there is no station data:7 station records (some MAINTENANCE), if there is station data: 7 station records (some MAINTENANCE)
- Maintenance activity records
- 2-3 work orders (reduced)
- 150-250 units produced
- Reduced inspection volume
- Maintenance-related defects
- Equipment status changes
- Planned maintenance exceptions
- Maintenance completion actions

Data Generation Rules

Temporal Consistency
- All timestamps must be chronologically ordered
- Activity durations must match start/end times
- Shift boundaries: 06:00, 14:00, 22:00
- Include realistic production ramp-up/down times

Production Logic
- Units must reference valid work orders
- Station capacity: max 60 units/hour per station
- Cycle times: 6-9 minutes typical range
- Quality inspections must match unit production

Equipment Relationships
- Equipment downtime correlates with station downtime
- Maintenance activities create exceptions and actions
- Calibration schedules affect quality results
- Equipment issues impact specific product types

Quality Patterns
- Defects cluster around equipment issues
- Inspector consistency in detection rates
- Severity escalation for unresolved issues
- Batch quality issues affect multiple units



Validation Checklist

Before outputting data, verify:
- [ ] All table IDs match the 9 specified tables
- [ ] All field names match exactly as specified
- [ ] Timestamps are in ISO 8601 format
- [ ] Foreign key references are valid
- [ ] Production quantities are realistic
- [ ] Quality metrics align with scenario
- [ ] Equipment status correlates with activities
- [ ] Exception severity matches impact level

Usage Instructions
CRITICAL: Generate all records in the exact chronological order they would occur in real manufacturing operations. Each record's timestamp must be later than any record it references or depends upon. Think like a time-traveling observer recording events as they happen minute-by-minute.

Select Scenario: Choose from the 5 scenario types

Define Time Period: Specify start datetime and duration (e.g., "Sept 1, 2025 06:00 for 3 days")

Create Timeline Framework: Establish chronological sequence of events with realistic timestamps

Generate Data in Temporal Order:

Phase 1 - Planning (T-0 to T+30min)

Create work orders with start times
Initialize equipment status records
Set station configurations
Phase 2 - Production Start (T+30min to T+1hr)

Begin station activity records
Start unit production with sequential timestamps
Log initial equipment status updates
Phase 3 - Ongoing Operations (T+1hr to T+End)

Continue unit production in time sequence
Perform inspections AFTER units are completed
Generate defects ONLY after failed inspections
Create exceptions when operational issues occur
Log equipment maintenance events with proper timing
Phase 4 - Response Actions (Throughout + Post-Event)

Create actions AFTER problems are identified
Update station activities to reflect downtime/issues
Close completed actions with realistic durations
Maintain Temporal Logic:

Each record must have timestamp ≥ all records it references
Inspection timestamp > unit completion timestamp
Defect timestamp ≥ inspection timestamp
Action creation timestamp ≥ triggering event timestamp
Equipment maintenance must align with station downtime
Validate Chronological Consistency:

Verify all timestamps follow logical sequence
Ensure foreign key references point to earlier records
Check that event durations are realistic
Confirm shift boundaries are respected
Build Realistic Event Chains:

Unit produced → Inspection performed → (If fail) Defect logged → Action created
Equipment issue → Station downtime → Exception logged → Maintenance action
Material shortage → Production delay → Exception → Procurement action
Format Output: Structure as chronologically ordered JSON with all 9 tables

Tools used

The tools used by this AI Agent are the following:

Data
- getTables
- getTable
- countRecords
- createRecord
- updateRecord
Users
- listUsers