We are seeking a Data Scientist or Data Engineer with strong expertise in PostgreSQL, ETL, Machine Learning, and Data Engineering for an opportunity based in Woodlawn, MD. Please review the updated requirements provided below and share qualified candidates at your earliest convenience. Thank you in advance for your prompt attention and support. Program Integrity Analytics Support, Advanced Modeling and Analytics Facility (AMAF) Work Focus : i. Completing the code design, data validation, and data architecture tasks needed in AMAF to hand the analytic processes off to the systems team ii. Assisting with alert and label sampling to ensure accuracy iii. Making changes as necessary in AMAF as the systems team migrates the algorithms through staging, testing, and into production iv. Producing process documentation for analysts. Data Scientist i. Skills and Expertise Experience using SAS, SQL, Python, and / or R to compile, transform, and export data. Experience integrating federal social insurance claims, payment, and other administrative data to produce accurate and detailed data tables, statistical analyses, and visualizations related to program integrity studies. Experience writing technical and detailed accompanying documentation for data analyses, datasets, and data visualizations that promotes an in-depth understanding of an analysis and serves as both an institutional knowledge and quality assurance tool. Experience anticipating problems and working to mitigate any anticipated cost, schedule, quality, or timeliness impacts to projects. Experience with building data pipelines, designing databases, and optimizing data flows, preferably with Greenplum. Knowledge of best practices in transitioning prototypes to production (e.g., error-handling, machine learning operations, etc.). Program Integrity Fraud Waste and Abuse Support Work Focus i. Project support, business analysis, data analysis, data engineering and data visualization required for the implementation and tracking of new fraud controls ii. The implementation of new services to detect and prevent fraud, waste and abuse. iii. Support the analytics processes to develop and test fraud models and conduct research and identify emerging threats Data Scientist i. Skills and Expertise Experience using SAS, SQL, Python, and / or R to compile, transform, and export data. Experience in Python, R, TensorFlow, Scikit-learn Experience conducting studies of fraud schemes. Experience evaluating federal social insurance program integrity activities, including researching relevant federal policy, programs, and administrative procedures; conducting data investigations to identify reliable data sources and elements; and producing detailed reports of analyses and findings. Experience integrating federal social insurance claims, payment, and other administrative data to produce accurate and detailed data tables, statistical analyses, and visualizations related to program integrity studies. Expertise in converting data analyses into interactive visualizations to showcase complex relationships, facilitate the exploration of data, and guide users through the information. Experience writing technical and detailed accompanying documentation for data analyses, datasets, and data visualizations that promotes an in-depth understanding of an analysis and serves as both an institutional knowledge and quality assurance tool. Experience anticipating problems and working to mitigate any anticipated cost, schedule, quality, or timeliness impacts to projects. Develop and deploy machine learning models Perform statistical analysis and predictive modeling Work with large datasets to uncover trends and patterns Communicate findings through visualizations and reports Collaborate with data engineers and analysts to refine data processing pipelines Data Engineer i. Skills and Expertise Experience in SQL, Spark, Hadoop, Kafka Develop and manage data pipelines and ETL (Extract, Transform, Load) processes Optimize database performance and data storage solutions Ensure data integrity, security, and governance Work with cloud platforms (AWS, Azure, GCP) for data warehousing Collaborate with Data Scientists and Analysts to ensure clean and structured data Candidates must have experience in the following areas Participation in daily stand-ups, sprint planning, and agile development activities. Documentation, training materials, and knowledge transfer to SSA staff. Maintain clear and open communication with all stakeholders throughout the transition process and ongoing performance of work. Provide regular updates on progress and address any concerns or issues promptly. Weekly written status reports summarizing tasks, progress, risks, and changes. Training documentation and job aids for new workflows and applications. Must have a minimum of 10 years of recent relevant experience as a data scientist or engineer dealing with big data and statistical models Must have a relevant master's degree Must be able to pass public trust suitability
Data Scientist • New York, New York, United States