--- name: example-sentences description: Requirements for creating and revising example sentences in je-dict-1. Covers minimum counts, progressive length, vocabulary restrictions by tier, and quality standards. --- # Example Sentence Guidelines This skill defines the standards for example sentences across all dictionary entries. All entry-creation and revision activities must follow these guidelines. --- ## Minimum Example Counts by Tier ### Basic and Core Tiers Every sense of every entry in the basic and core tiers must have **at least 5 example sentences**. | Tier | Minimum Examples per Sense | |------|---------------------------| | Basic | 5 | | Core | 5 | ### General Tier Every sense of every entry in the general tier must have **at least 3 example sentences**. | Tier | Minimum Examples per Sense | |------|---------------------------| | General | 3 | **Multi-sense entries:** The requirements apply per sense. An entry with 3 senses in the basic tier needs at least 15 examples total (5 per sense). --- ## Progressive Length Requirement Examples within each sense should progress from shorter to longer: 1. **Example 1**: Short and simple - demonstrates the word clearly 2. **Example 2**: Slightly longer - shows basic context 3. **Example 3**: Medium length - natural usage with fuller context 4. **Example 4** (basic/core): Longer sentence with more complex structure 5. **Example 5** (basic/core): Longest - may consist of 2-3 short sentences or one longer sentence with relative clauses, compound structures, etc. **Key principle:** Early examples help learners recognize the word; later examples show how the word functions in natural, flowing Japanese. ### Length Guidelines | Example | Target Length (Japanese) | |---------|-------------------------| | 1 | 5-15 characters | | 2 | 10-20 characters | | 3 | 15-30 characters | | 4 | 25-45 characters | | 5+ | 35-70 characters (or 2-3 short sentences) | These are guidelines, not strict rules. Natural expression takes priority over hitting exact character counts. --- ## Vocabulary Restrictions by Tier ### Basic Tier Entries For entries in the **basic** vocabulary tier: | Examples | Vocabulary Restriction | |----------|----------------------| | 1-2 | **Basic tier only** - Use only words from the basic tier | | 3-5 | **Basic + Core tiers** - May use basic and core vocabulary | **Critical:** Basic tier examples must contain **no general-tier or unlisted vocabulary**. This ensures learners at the basic level can fully understand all examples without encountering unknown words. ### Core Tier Entries For entries in the **core** vocabulary tier: | Examples | Vocabulary Restriction | |----------|----------------------| | 1-2 | **Basic + Core tiers** - Use only basic or core vocabulary | | 3-5 | **No restriction** - May use any vocabulary | **Recommendation:** Even for unrestricted examples (3-5), avoid vocabulary that is not in the dictionary. Learners should be able to look up any unfamiliar word. ### General Tier Entries For entries in the **general** vocabulary tier: | Examples | Vocabulary Restriction | |----------|----------------------| | 1-3+ | **No restriction** - May use any vocabulary | **Recommendation:** Avoid obscure vocabulary not in the dictionary. Use common, natural expressions that learners are likely to encounter. --- ## Vocabulary Tier Verification When creating examples for basic and core tier entries, verify vocabulary tier compliance: ```bash # Check if a word is in the dictionary and its tier python3 build/check_duplicate.py "word" "reading" ``` For systematic verification during polishing: - Read the example sentence - Identify all content words (nouns, verbs, adjectives, adverbs) - Verify each word is in the appropriate tier - Replace non-compliant words with tier-appropriate alternatives **Common substitution patterns:** | Non-compliant | Basic alternative | Core alternative | |--------------|-------------------|------------------| | {購入|こうにゅう}する | {買|か}う | {買|か}う | | {使用|しよう}する | {使|つか}う | {使|つか}う | | {非常|ひじょう}に | とても | とても/{大変|たいへん} | | {困難|こんなん} | {難|むずか}しい | {難|むずか}しい | --- ## Sense Numbers Requirement Every example **must** include a `sense_numbers` field linking it to the definition(s) it illustrates. ```json "examples": [ { "id": "00001_word_ex1", "japanese": "...", "english": "...", "sense_numbers": [1], "has_audio": false, "notes": null } ] ``` ### Rules - **Single-sense entries**: Use `[1]` for all examples - **Multi-sense entries**: Each example must specify which sense(s) it demonstrates - **Examples illustrating multiple senses**: Use `[1, 2]` format - **Must reference valid senses**: Numbers must match `sense_number` values in definitions ### Distribution for Multi-Sense Entries For entries with multiple senses in basic/core tiers: - Each sense needs at least 5 examples - Distribute examples to cover all senses adequately - Some examples may illustrate multiple senses (use `[1, 2]` notation) --- ## Quality Standards ### Essential Requirements 1. **Natural Japanese**: Examples should sound like something a native speaker would say 2. **Clear context**: The usage should be understandable from the example 3. **Complete furigana**: All kanji must have `{kanji|reading}` markup 4. **Accurate translation**: English captures meaning without being overly literal ### Content Quality 1. **First example should be simple** - Demonstrate the word clearly without complexity 2. **Progress to natural complexity** - Later examples show real-world usage 3. **Include fixed phrases** - High-frequency collocations aid memory (at least one per sense) 4. **Show grammatical connections** - Demonstrate how words connect with particles, etc. 5. **Annotate non-obvious grammar** - Use `notes` field for explanations when needed ### What to Demonstrate Examples should illustrate: - Common collocations and set phrases - Typical particle patterns (for verbs) - Predicate vs. modifier usage (for adjectives) - Register-appropriate contexts - Both literal and figurative meanings (if applicable) --- ## Example Format Reference ```json { "id": "00396_taberu_ex1", "japanese": "{朝|あさ}ごはんを{食|た}べる。", "english": "To eat breakfast.", "sense_numbers": [1], "has_audio": false, "notes": null } ``` ### ID Format Pattern: `{entry_id}_ex{N}` - `entry_id`: The entry's ID (e.g., `00396_taberu`) - `N`: Sequential number starting from 1 ### Fields | Field | Required | Description | |-------|----------|-------------| | `id` | Yes | Unique example ID | | `japanese` | Yes | Japanese sentence with furigana | | `english` | Yes | Natural English translation | | `sense_numbers` | Yes | Array of sense numbers illustrated | | `has_audio` | Yes | Boolean (usually `false`) | | `notes` | Yes | Explanatory note or `null` | --- ## Common Issues to Avoid 1. **Adding inline links**: Do NOT add inline word links (⟦...⟧) when creating or revising examples. Links are added in a separate polishing step via `prompts/polish_add_inline_links.md`. 2. **Missing furigana**: `お金が余った` instead of `お{金|かね}が{余|あま}った` 3. **Overly literal translations**: "As for me, I eat sushi" instead of "I eat sushi" 4. **Wrong sense_numbers**: Example linked to wrong definition 5. **Unnatural constructions**: Grammatically correct but rarely used patterns 6. **Vocabulary tier violations**: Using general-tier words in basic-tier examples 1-2 7. **Insufficient examples**: Not meeting minimum counts per sense 8. **No length progression**: All examples being similar length --- ## Polishing Checklist for Examples When reviewing or revising examples, verify: ### Count and Distribution - [ ] Minimum examples per sense met (5 for basic/core, 3 for general) - [ ] All senses have adequate example coverage - [ ] Examples are properly distributed across senses ### Progressive Length - [ ] Examples progress from shorter to longer within each sense - [ ] Final examples are substantially longer or multi-sentence ### Vocabulary Restrictions - [ ] Basic tier: Examples 1-2 use only basic vocabulary - [ ] Basic tier: Examples 3-5 use only basic+core vocabulary - [ ] Core tier: Examples 1-2 use only basic+core vocabulary - [ ] No unlisted vocabulary in restricted examples ### Quality - [ ] All examples sound natural - [ ] All kanji have complete furigana - [ ] Translations are accurate and natural - [ ] sense_numbers are correct for each example - [ ] At least one example shows a common collocation - [ ] Grammar patterns are demonstrated clearly ### Format - [ ] IDs follow `{entry_id}_ex{N}` pattern - [ ] IDs are sequential (ex1, ex2, ex3...) - [ ] All required fields present - [ ] `has_audio` is accurate - [ ] `notes` field present (null if not needed) --- ## Adding Examples to Existing Entries When adding examples to bring an entry into compliance: 1. **Read the existing entry** to understand current examples 2. **Identify gaps**: Which senses need more examples? What lengths are missing? 3. **Check vocabulary tier**: Determine what vocabulary can be used 4. **Write new examples** following progressive length guidelines 5. **Verify compliance** with all requirements 6. **Update `modified` timestamp** in metadata ### Example Addition Template For a basic-tier entry with one sense needing 5 examples: ``` Example 1: [short, basic vocab only] Example 2: [short-medium, basic vocab only] Example 3: [medium, basic+core vocab] Example 4: [medium-long, basic+core vocab] Example 5: [long or multi-sentence, basic+core vocab] ``` --- ## Summary Table | Tier | Min Examples/Sense | Ex 1-2 Vocab | Ex 3+ Vocab | |------|-------------------|--------------|-------------| | Basic | 5 | Basic only | Basic + Core | | Core | 5 | Basic + Core | Any (prefer dictionary words) | | General | 3 | Any | Any (prefer dictionary words) |