add working docs...

NiloCK · NiloCK · commit 1821a6b63727 · 2025-07-16T13:47:23.000-03:00
partly out of date atm, but worth keeping
diff --git a/agent/a.4.orchestration-and-prompts.md b/agent/a.4.orchestration-and-prompts.md
@@ -0,0 +1,189 @@
+# Assessment: Orchestration & Prompts for MCP Content Mining
+
+## Context & Requirements Synthesis
+
+### User Priorities Identified
+1. **Two Primary Mining Modes**:
+   - **Greenfield content mining** - Starting from scratch with new source material
+   - **Diff content mining** - Analyzing changes/updates to existing content
+2. **Rough ELO estimation** - Platform handles real-time convergence, agents provide sane initial sorting
+3. **Leverage existing patterns** from `@packages/edit-ui/src/components/BulkImportView.vue`
+4. **Atomic tool composition** - Agents orchestrate using create_card, update_card, etc.
+5. **Flexible guidance** - Prompts suggest strategies without rigid constraints
+
+### Existing Bulk Import Knowledge
+From `/agent/mcp/prompt-bulk.md`, the platform already has:
+- **Detailed fillIn formatting rules** - `{{answer}}`, `{{multiple|options}}`, `{{correct||distractor1|distractor2}}`
+- **ELO calibration guidelines** - 100-500 beginner, 500-1000 early, 1000-1500 intermediate, etc.
+- **Tag formatting conventions** - no spaces, dots/hyphens allowed, comma-separated
+- **Card separation syntax** - double `---` lines
+- **Markdown support** - code blocks, formatting within questions and answers
+- **Content best practices** - concise answers, clear questions, code examples
+
+## MCP Prompts vs Tools Analysis
+
+### Why Prompts Are Superior for Content Mining
+
+**MCP Prompts provide:**
+- ✅ **Expert guidance** without execution constraints  
+- ✅ **Parameter customization** for different contexts
+- ✅ **Agent autonomy** - can adapt, combine, or ignore advice
+- ✅ **Composability** - work with agent's own intelligence
+- ✅ **Maintainability** - easy to update expertise without code changes
+
+**Compared to orchestrator tools which would:**
+- ❌ **Lock in rigid workflows** that may not fit all scenarios
+- ❌ **Hide decision-making** from agent reasoning
+- ❌ **Create debugging complexity** with nested tool calls
+- ❌ **Limit creativity** with predefined patterns
+
+### Agent Decision Process for Prompts
+
+**Agent prompt selection workflow:**
+1. **User request analysis** - "Generate quizzes from this Go codebase"
+2. **Available prompts discovery** - Lists MCP prompts via protocol
+3. **Context matching** - Identifies this as greenfield mining scenario
+4. **Prompt invocation** - Calls `greenfield_content_mining` with repo context
+5. **Guidance integration** - Synthesizes prompt advice with own reasoning
+6. **Tool orchestration** - Uses create_card, tag_card atomically following strategy
+
+## Proposed MCP Prompt Architecture
+
+### Core Prompt Templates (4 Essential)
+
+#### 1. `greenfield_content_mining`
+**Purpose:** Guide systematic analysis of new source material for comprehensive courseware creation
+
+**Parameters:**
+- `sourceType` - "codebase", "documentation", "tutorials", "papers"
+- `domain` - "programming", "math", "science", etc.
+- `targetAudience` - "beginner", "intermediate", "advanced"
+- `scopeConstraint` - "single-file", "module", "full-project"
+
+**Template provides:**
+- **Repository structure analysis** - How to identify key concepts vs implementation details
+- **Content prioritization** - Core concepts first, edge cases later
+- **Knowledge dependency mapping** - Prerequisites and logical sequencing
+- **Coverage strategies** - Breadth vs depth decisions
+- **ELO estimation guidance** - Initial difficulty assessment based on concept complexity
+
+#### 2. `diff_content_mining` 
+**Purpose:** Guide analysis of changes/updates to identify new quiz opportunities
+
+**Parameters:**
+- `changeType` - "feature-addition", "refactor", "bug-fix", "docs-update"
+- `diffScope` - "lines-changed", "files-affected", "concepts-modified"
+- `existingCoverage` - "sparse", "moderate", "comprehensive"
+
+**Template provides:**
+- **Change impact analysis** - What concepts are newly introduced vs modified
+- **Gap identification** - What existing cards need updates vs new creation
+- **Incremental strategies** - How to build on existing content
+- **Deprecation handling** - Managing outdated concepts
+- **Version tracking** - Linking content to specific code versions
+
+#### 3. `fillIn_question_crafting`
+**Purpose:** Guide creation of effective fill-in-the-blank questions using Vue-Skuilder syntax
+
+**Parameters:**
+- `contentType` - "code", "theory", "api-usage", "troubleshooting"  
+- `cognitiveLevel` - "recall", "application", "analysis", "synthesis"
+- `answerComplexity` - "single-word", "short-phrase", "code-snippet"
+
+**Template provides:**
+- **Format patterns** from bulk import experience:
+  - `{{simple_answer}}` for direct recall
+  - `{{option1|option2}}` for multiple acceptable answers
+  - `{{correct||distractor1|distractor2}}` for multiple choice
+- **Code integration best practices** - Balancing context with focused questions
+- **Answer entropy management** - Keeping answers matchable while meaningful
+- **Distractor selection** - Common mistakes and plausible alternatives
+
+#### 4. `elo_rough_calibration`
+**Purpose:** Guide initial ELO estimation knowing real-world performance will refine
+
+**Parameters:**
+- `conceptComplexity` - "fundamental", "applied", "nuanced", "edge-case"
+- `prerequisiteDepth` - "none", "basic", "substantial", "advanced"  
+- `cognitiveLoad` - "recall", "comprehension", "application", "analysis"
+
+**Template provides:**
+- **Initial ELO bands** refined from bulk import guidelines:
+  - 100-500: Basic vocabulary, simple facts
+  - 500-1000: Applied knowledge, straightforward procedures  
+  - 1000-1500: Integration concepts, moderate problem-solving
+  - 1500-2000: Complex applications, multi-step reasoning
+  - 2000+: Expert insights, edge cases, advanced synthesis
+- **Comparative benchmarking** - How to assess relative to existing content
+- **Uncertainty handling** - When to err conservative vs aggressive
+- **Refinement expectations** - Understanding platform will auto-adjust
+
+## Prompt Implementation Strategy
+
+### MCP Technical Integration
+
+**Prompt registration pattern:**
+```typescript
+this.mcpServer.registerPrompt(
+  'greenfield_content_mining',
+  {
+    title: 'Greenfield Content Mining Strategy',
+    description: 'Systematic approach for mining new source material',
+    arguments: [
+      { name: 'sourceType', description: 'Type of source material being analyzed' },
+      { name: 'domain', description: 'Subject domain of the content' },
+      { name: 'targetAudience', description: 'Intended learning audience level' },
+      { name: 'scopeConstraint', description: 'Scope of analysis (file, module, project)' }
+    ]
+  },
+  async (args) => ({
+    messages: [
+      {
+        role: 'user', 
+        content: generateGreenfield ContentMiningPrompt(args)
+      }
+    ]
+  })
+);
+```
+
+### Content Strategy
+
+**Prompts should contain:**
+- ✅ **Structured methodologies** - Step-by-step approaches
+- ✅ **Decision frameworks** - When to choose different strategies
+- ✅ **Quality criteria** - What makes good vs poor questions
+- ✅ **Practical examples** - Concrete patterns for different content types
+- ✅ **Vue-Skuilder specific** - Leverage platform features and conventions
+
+**Prompts should NOT contain:**
+- ❌ **Rigid scripts** - Agents need flexibility
+- ❌ **Implementation details** - Keep focused on strategy
+- ❌ **Tool invocation** - Agents decide when to use create_card, etc.
+- ❌ **Fixed sequences** - Allow for creative orchestration
+
+## Organizational Benefits
+
+### Separation of Concerns
+- **Prompts**: Domain expertise and strategy guidance
+- **Agent**: Intelligence, creativity, and orchestration
+- **Tools**: Atomic operations (create, update, tag, delete)
+- **Platform**: Real-time ELO convergence and performance tracking
+
+### Maintainability  
+- **Prompt updates** don't require code changes
+- **Strategy evolution** can be managed independently
+- **Domain expertise** centralized and reusable
+- **Testing simplified** - prompts testable in isolation
+
+### Scalability
+- **New domains** just need new prompt templates
+- **Different agent types** can use same prompt guidance
+- **Cross-platform portability** via MCP standard
+- **Expertise sharing** across agent implementations
+
+## Recommendation
+
+Implement the 4 core prompt templates as Phase 2.2, replacing the complex orchestrator tool approach. This provides the perfect balance of expert guidance with agent autonomy, leveraging existing Vue-Skuilder patterns while maintaining full flexibility for creative content mining strategies.
+
+The agent can then choose appropriate prompts based on context, synthesize the guidance with their own reasoning, and execute via the atomic tools we've already implemented.
diff --git a/agent/mcp/prompt-bulk.md b/agent/mcp/prompt-bulk.md
@@ -0,0 +1,138 @@
+## Guide for Authoring Bulk Flashcard Content (`fillIn` Type)
+
+This guide explains how to format plain text for bulk import of flashcards. The system processes this text to create individual flashcards with questions, answers, and tags. Each card will be of the "fill-in-the-blank" or "multiple choice" style.
+
+**General Format Rules:**
+
+1.  **Card Separation**:
+    *   Each card's content must be separated from the next by **two consecutive lines, each containing exactly three hyphens (`---`)**.
+    *   Example:
+        ```text
+        Content for Card 1...
+        tags: example
+        ---
+        ---
+        Content for Card 2...
+        ```
+
+2.  **Tags and ELO Metadata**:
+    *   Tags are optional.
+    *   Tags do not contain spaces, but can use dots or hyphens.
+    *   If included, they **must** be on one of the **last lines** of a card's content block (before the `---` separator).
+    *   The line must start with `tags:` (case-insensitive) followed by a comma-separated list of tags.
+    *   Example: `tags: javascript, programming, web-development`
+    *   If this line is omitted, the card will have no tags.
+    *   You can also include an ELO rating by adding a line with `elo: [integer]` (case-insensitive).
+    *   The ELO rating indicates difficulty level, using a chess-like rating scale:
+        * 100-500: Beginner level, basic knowledge
+        * 500-1000: Some knowledge, early understanding
+        * 1000-1500: Intermediate knowledge
+        * 1500-2000: Advanced understanding
+        * 2000-2400: Expert level
+        * 2400+: Elite/master level
+    *   Example: `elo: 1250` or `tags: go, functions` followed by `elo: 850`
+
+3.  **Plain Text Only**:
+    *   The bulk input is for plain text markdown content only. Do not attempt to include or reference image/audio uploads directly in this bulk format.
+
+**Content Formatting for `fillIn` Questions:**
+
+The main content of your card is markdown. Special interactive elements (blanks or multiple-choice options) are defined using double curly braces: `{{ ... }}`. The content inside the curly braces can also contain markdown formatting, including code blocks.
+
+1.  **Basic Fill-in-the-Blank (Single Answer)**:
+    *   Use `{{answer}}` where the blank should appear in your question.
+    *   The text inside the `{{ }}` is the exact expected answer.
+    *   **Example Card**:
+        ```text
+        The capital of France is {{Paris}}.
+        tags: geography, europe
+        ```
+
+2.  **Fill-in-the-Blank (Multiple Accepted Answers)**:
+    *   If multiple text inputs are acceptable for a single blank, separate them with a pipe `|` character inside the `{{ }}`.
+    *   The user only needs to type one of these to be correct.
+    *   **Example Card**:
+        ```text
+        A common way to declare a variable in JavaScript that can be reassigned is using the {{let|var}} keyword.
+        tags: javascript, programming
+        ```
+        *(Here, both "let" and "var" would be accepted as correct for the blank).*
+
+3.  **Multiple Choice Questions (Radio Buttons)**:
+    *   This format also uses the `{{ ... }}` syntax but with a specific internal structure to define correct answers and distractors.
+    *   The structure is: `{{correct_answer1|correct_answer2||distractor1|distractor2}}`
+        *   **Correct Answers**: List one or more correct answers first, separated by `|`.
+        *   **Separator**: Use a double pipe `||` to separate the list of correct answers from the list of distractors.
+        *   **Distractors**: List one or more incorrect answers (distractors) after the `||`, separated by `|`.
+    *   The system will automatically create a multiple-choice question. It will pick *one* of your specified correct answers and display it along with *all* the specified distractors, shuffled, as radio button options.
+    *   **Example Card**:
+        ```text
+        Which of the following is a primary color?
+        {{Red|Blue|Yellow||Green|Orange|Purple}}
+        tags: art, colors, multiple choice
+        ```
+        *(In this example, "Red", "Blue", and "Yellow" are all correct. The system might generate a question with choices like "Blue", "Green", "Orange", "Purple" (shuffled). If the user selects "Blue", they are correct.)*
+
+    *   **Example Card (Single Correct Answer for Multiple Choice)**:
+        ```text
+        What is the main gas found in the air we breathe?
+        {{Nitrogen||Oxygen|Carbon Dioxide|Hydrogen}}
+        tags: science, chemistry, atmosphere
+        ```
+        *(Here, only "Nitrogen" is correct. The choices presented would be "Nitrogen", "Oxygen", "Carbon Dioxide", "Hydrogen" (shuffled).)*
+
+**Summary of `{{ ... }}` Syntax:**
+
+*   `{{answer}}`: Simple blank, one correct string (can include markdown).
+*   `{{ans1|ans2}}`: Simple blank, multiple accepted strings (user types one).
+*   `{{correctAns1|correctAns2||distractor1|distractor2}}`: Multiple choice. One of the `correctAns` will be shown with all `distractors`.
+*   All answers can contain markdown, including code blocks, lists, and other formatting.
+
+**Complete Example of a Small Bulk Input:**
+
+```text
+What is the chemical symbol for water?
+{{H2O|HOH}}
+tags: chemistry, science
+elo: 500
+
+---
+---
+
+Which planet is known as the Red Planet?
+{{Mars||Jupiter|Saturn|Venus}}
+tags: astronomy, solar system, multiple choice
+elo: 750
+
+---
+---
+
+The `typeof` operator in JavaScript returns a ____ indicating the type of the unevaluated operand.
+{{string}}
+elo: 1250
+tags: javascript, programming, operators
+```
+
+**Tips for Authors:**
+
+*   **Clarity is Key**: Ensure your questions are clear and the blanks/choices are unambiguous.
+*   **Markdown**: Standard markdown can be used for formatting the question text (e.g., bold, italics, lists), but keep it simple.
+*   **Code Blocks in Questions**: Do not use the `{{ ... }}` syntax inside code blocks (```). If you need to indicate a blank in code, place the `{{ ... }}` outside the code block and make it clear what should be filled in.
+*   **Code Examples in Questions**: It can be very useful to include code blocks in the main body of your question, then ask about that code. For example:
+        ```go
+        func main() {
+            x := 10
+            fmt.Println(x)
+        }
+        ```
+        The above program will {{compile and run correctly||fail at compilation|fail at runtime}}
+*   **Code Blocks in Answers**: Code blocks ARE allowed inside the `{{ ... }}` syntax as part of the answer. For example:
+
+        {{```go
+        func main() {
+            fmt.Println("Hello")
+        }
+        ```}}
+*   **Keep Answers Concise**: Avoid very long answers in fill-in-the-blank questions. Long answers have high entropy, making it difficult for users to match the expected answer even if they understand the concept. Prefer short, specific answers (1-3 words is ideal) or use multiple-choice for complex answers.
+*   **Review `{{ ... }}` Carefully**: Typos within the `{{ }}` syntax, especially with the `|` and `||` separators, can lead to incorrect parsing.
+*   **Test Small Batches**: If you're unsure, try importing a small batch of 2-3 cards first to verify they are parsed as expected.