Python Data Engineer

Altoida, Inc.

Altoida, Inc.

Software Engineering, Data Science
Boston, MA, USA · Boston, MA, USA · Massachusetts, USA · United States
Posted on Tuesday, January 9, 2024
Altoida is a pioneer in developing digital biomarkers of neurological disease using augmented reality and machine learning. Our technology platform is designed to enable an objective evaluation of an individual’s neurological health, allowing for more accurate patient selection and stratification for clinical trials, as well as sensitive monitoring of disease progression. Altoida’s mission is to provide life sciences companies with actionable insights using real-world data to increase the success rate for novel therapeutics, and usher in a new era of precision neurology using digital biomarkers. Our proprietary evidence-based platform is founded on more than 20 years of scientific research and published in multiple peer-reviewed papers including Nature Digital Medicine. For more information, visit . Follow us on Twitter @altoida.

Altoida’s culture is united around six core principles:

  • Patient-First
  • Open Communication
  • Collaborative
  • Reliable
  • Embrace Positive Change
  • Think and Build at Scale

About The Role

We are looking for a Data Engineer with Python expertise to develop and implement: ETL pipelines, data quality reporting, and pipeline monitoring resources for data ingestion and delivery of our proprietary data sets and machine learning outputs. The candidate will work closely with our ML engineers and data scientists to help scale the AI/ML modeling workflow so we can continue to meet the needs of our customers. The role encompasses end-to-end work such that the ideal candidate will have experience following raw data from ingestion all the way through to a completed feature data set prior to model scoring.


  • Design and Implementation: Lead the design, construction, and optimization of systems for data collection, data transformation, and quality verification for Altoida’s proprietary data sets and metadata.
  • Data Pipeline Management: Create and maintain data pipelines used by both data scientists and business analysts.
  • Collaboration and Interpretation: Work closely with our internal and external SME to troubleshoot issues, as well as with our data scientists and analysts to efficiently utilize our curated data, and translate complex datasets into actionable insights.
  • Data Visualization infrastructure: Assist with thedevelopment and maintenance of our reporting infrastructure (in terms of data pipelines that feed our dashboards, reports, and custom visualizations, etc).
  • Continuous Improvement: Proactively identify and implement enhancements in data processing efficiency, integrity, and quality with a sharp eye towards elevating the standards of our data handling practices.
  • Performance optimization: Contribute to the betterment of Altoida’s data ecosystem and SOPs by helping to improve resource management practices (e.g., within AWS), as well as data storage & retrieval processes across various environments (ie., development, staging, production).
  • Documentation: Thoroughly document data pipeline processes and systems for internal references, as well as for regulatory requirements (i.e., FDA).

Skills & Experience

  • Graduate degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field.
  • Five or more years of experience in a Data Engineer role.
  • Experience using the following software / tools:
  • Data pipeline/workflow management tools (e.g., Airflow, Luigi).
  • Container orchestration (Docker, Kubernetes).
  • AWS cloud services (EC2, EMR, RDS).
  • Big Data technologies (Spark, Kafka).
  • Database management (SQL, NoSQL, Postgres).
  • Python programming for object-oriented/function scripting.
  • Experience building and optimizing data pipelines, architectures and data sets.
  • Experience supporting and working with cross-functional teams in a fast-paced, dynamic environment.
  • Experience performing root cause analysis on data and related processes to identify improvement opportunities.
  • Excellent communication skills (written, oral, and visual), especially for explaining issues to those with limited expertise in technical fields
  • Experience with healthcare data (e.g., clinical trials data, claims, ehr/emr systems, etc.) is a plus


Altoida provides competitive and comprehensive compensation and benefits programs. Specific benefit offerings may vary by location, position, and/or business unit. The full-time salary range is commensurate with experience.


Altoida’s US headquarters is located in Washington, DC. The position is remote.

Equal Opportunity

Altoida does not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. Altoida is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process.