Currently creatiin chatbot contans lot’s of manual steps that can be automated (data scraper) https://github.com/asya-ai/delloite-data-scraper/ Another scraper for web pages (these are based on puppeteer, but might use selenium if problem with coding in TypeScript) https://github.com/asya-ai/chatbot-knowledge-scraper
Chatbot itself: https://github.com/asya-ai/delloite-chatbot
^ Both github HenriARM
access given
Data scraper produces JSON which is then processed by Chatbot code - embeddings are not stored and are re-computed for each session, lot’s of optimizations and code-rewrite needed, but this was Proof of Concept.
Later designer will make design, but this is what we propose as a product version
Make code review note and UML proposal for database structure
Make flow chart based on existing code and methodolgy for automatically producing dataset
FYI tags for facts are missing, but will improve conversations
Make flow chart based on existing code and methodolgy for answering questions using existing dataset
FYI content intent classifiers are missing, but will improve conversations (we already have them as APIs)
There will be Python Flask based frontend/backend, worker for processing datasets and multiple language model APIs (we will try to avoid OpenAI and alike as much as possiblke and optimize our own models for speed and accuracy)
Focus markets not English speaking - for example Latvian (we already have translation APIs)