-
-
Notifications
You must be signed in to change notification settings - Fork 583
Add FuzzySharp-based text analysis plugin for domain-specific typo detection and entity extraction #1208
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
PR Compliance Guide 🔍Below is a summary of compliance checks for this PR:
Compliance status legend🟢 - Fully Compliant🟡 - Partial Compliant 🔴 - Not Compliant ⚪ - Requires Further Human Verification 🏷️ - Compliance label |
||||||||||||||||||||||||||
PR Code Suggestions ✨Explore these optional code suggestions:
|
|||||||||||||||||
|
Please remove any business related documents. |
User description
How to Test Locally
1. Copy Data Source
Copy the folder

/BotSharp/src/Plugins/BotSharp.Plugin.FuzzySharp/data/fuzzySharpunder/BotSharp/src/WebStarter/bin/Debug/net8.0/data/plugins2. Start the Application
Run the WebStarter project.
3. Run Tests via Postman
PR Type
Enhancement
Description
Add FuzzySharp plugin for domain-specific text analysis
Implement text tokenization and vocabulary loading
Create REST API endpoint for text analysis
Integrate plugin into BotSharp architecture
Diagram Walkthrough
File Walkthrough
21 files
Text analysis request model with parametersN-gram processing interface definitionResult deduplication and sorting interfaceMain text analysis service interfaceToken matching interface and context modelsVocabulary and domain term loading interfaceFlagged item model with match metadataText analysis response model structureMatch type constants for resultsText tokenization separator charactersREST API endpoint for text analysisPlugin registration and dependency injectionDomain term mapping matcher implementationExact vocabulary match implementationFuzzy matching for typo correctionN-gram processing with priority-based matchingResult deduplication and sorting logicMain text analysis service implementationCSV vocabulary and domain term loadingGlobal using statements for pluginText preprocessing and tokenization utilities4 files
Add FuzzySharp plugin to solutionFuzzySharp plugin project fileAdd FuzzySharp plugin referenceRegister FuzzySharp in plugin list1 files
Add CsvHelper and FuzzySharp dependencies