Job Description : Building standalone applications that interact with LLM modelsBuilding RAG-based applications Understanding of Vector DBs like Solr for LLM based applicationsIntegrate models with existing systems and APIs.Implement production quality ETL jobsPreprocess and manage data for training and deployment.Collaborate with cross-functional teams to define, design, and ship new features.Write clean, maintainable, and efficient code.Document development processes, code, and APIs.REQUIREMENTSProven experience in building customer facing ML based APIsExperience in developing applications that are scalable to handle TBs of dataStrong knowledge of API integration (RESTful, GraphQLExperience with data preprocessing, SQL, and NoSQL databases as well as vector stores (e.g., Postgres, MySQL, Solr, Elasticsearch / OpenSearch, etcFamiliarity with deployment tools (Docker, KubernetesExperience with DevOps tools like Jenkins, Terraform, Cloud Formation templates is highly preferred.Excellent problem-solving and communication skills.Experience with Spark / Hadoop, EMR or any other Big Data technology would be a plusAbility to work collaboratively in an agile team environment We are an equal opportunity employer. All aspects of employment including the decision to hire, promote, discipline, or discharge, will be based on merit, competence, performance, and business needs. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, national origin, citizenship / immigration status, veteran status, or any other status protected under federal, state, or local law.
Data Scientist • Austin, TX, United States of America