Chatbot Evaluation through Q&A Test Sets
Join our team to help advance the evaluation of a cutting-edge data analysis chatbot. This is a hands-on opportunity to work in a fast-paced environment, contributing to a project that spans multiple industries and enhances our chatbot's ability to provide accurate, relevant insights across diverse data domains. In this role, you’ll develop a collection of Question & Answer (Q&A) sets designed to test and benchmark the performance of our AI-driven chatbot. You’ll work directly with structured datasets across specific industries to create targeted Q&A sets, helping us improve and refine our chatbot’s data analysis capabilities. Who Should Apply: Experience using ChatGPT, Llama, and/or other LLMs. Students in fields such as Data Science, Computer Science, Business Analytics, or similar. Strong analytical and problem-solving skills, with an interest in data science and machine learning. Experience with large datasets and familiarity with data visualization and trend analysis. Working experience in industries relating to healthcare, environment, marketing, and others.