Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions docs/_static/customize.css
Original file line number Diff line number Diff line change
@@ -1,2 +1,4 @@
.admonition.pattern-metadata { border-left-color: #f0ad4e; }
.admonition.paper-reference { border-left-color: #5bc0de; }

.scrollable pre { max-height: 400px; overflow-y: auto; }
11 changes: 11 additions & 0 deletions docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -171,6 +171,17 @@ nemotron/embed/README.md
nemotron/artifacts.md
```

```{toctree}
:caption: Synthetic Data Generation
:hidden:

About <sdg/index>
Getting Started <sdg/getting-started>
Tips for Using Agents <sdg/using-skills>
Tasks <sdg/how-to/index>
Reference <sdg/reference/index>
```

```{toctree}
:caption: Customization
:hidden:
Expand Down
75 changes: 75 additions & 0 deletions docs/sdg/_snippets/input/greenteme.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
output_dir: ${oc.env:SDG_OUTPUT_DIR,${oc.env:NEMO_RUN_DIR,${oc.env:PWD}/output}/sdg}
output_path: ${output_dir}/greenteme_sft.jsonl
num_records: 100

seed_dataset:
path: ${oc.env:PWD}/src/nemotron/steps/sdg/data_designer/data/greenteme_inquiry_seeds.jsonl
strategy: shuffle
fields: [scenario]

models:
- alias: nvidia-text
model: nvidia/nemotron-3-nano-30b-a3b
provider: nvidia
skip_health_check: true
inference_parameters:
temperature: 0.8
top_p: 1.0
max_tokens: 1200

columns:
- name: traveler_segment
type: category
values:
- frequent_flyer
- business_traveler
- family_with_children
- first_time_international
- elite_loyalty_member
- leisure_couple

- name: inquiry_type
type: category
values:
- rebooking
- baggage_issue
- refund_request
- loyalty_status
- fare_rules
- flight_status

- name: channel
type: category
values: [chat, phone, app]

- name: user_query
type: llm_text
model_alias: nvidia-text
prompt: |
You are role-playing a {{ traveler_segment }} contacting Greenteme Airlines
via {{ channel }} about a {{ inquiry_type }}. The scenario is:
"{{ scenario }}"

Write the customer's first message. Keep it natural, 1-3 sentences.
Do not reference any real airline name, real flight number, or real
loyalty program.

- name: assistant_response
type: llm_text
model_alias: nvidia-text
prompt: |
You are a customer-service agent at Greenteme Airlines, a fictional airline.
Reply to this customer message:

"{{ user_query }}"

Provide a concise, professional, compliant response, 2-4 sentences. Stay
realistic and grounded in standard airline policy. Do not invent real
airline names, real flight numbers, real PNR codes, or real loyalty
program details. No markdown.

output_projection:
type: openai_messages
user_field: user_query
assistant_field: assistant_response
metadata_fields: [traveler_segment, inquiry_type, channel, scenario]
12 changes: 12 additions & 0 deletions docs/sdg/_snippets/input/greenteme_inquiry_seeds.jsonl
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
{"scenario": "Connecting flight cancelled due to weather; customer needs to arrive at destination by tomorrow morning for a wedding."}
{"scenario": "Checked baggage missing on arrival; flight landed two hours ago and the bag did not appear at the carousel."}
{"scenario": "Customer wants a refund on a non-refundable ticket due to a documented medical emergency."}
{"scenario": "Customer is unsure why their loyalty status was downgraded this year and wants to understand the qualifying criteria."}
{"scenario": "Customer wants to change a fare class on an existing booking and needs to know the fare difference and any change fees."}
{"scenario": "Flight is showing a four-hour delay and the customer wants to know whether they will make their connection."}
{"scenario": "Customer was double-charged for a seat upgrade and wants the duplicate charge reversed."}
{"scenario": "Customer needs to add a service animal to an upcoming international flight and wants to know what documentation is required."}
{"scenario": "Bag damaged in transit; customer needs to file a claim and wants the timeline and required documentation."}
{"scenario": "Customer rebooked through self-service and is now seated apart from a travel companion; they want to be reseated together."}
{"scenario": "Customer wants to use a travel credit from a previous cancellation but cannot find the credit number in their account."}
{"scenario": "Customer's payment method was declined when trying to complete a booking and they want to know what to do."}
Loading
Loading