Digital Health Data Engineer - Remote Latam

Auto Import

<p><br><strong>Excellent opportunity to work REMOTELY with a U.S.-based company. Candidates living in Mexico, or all LATAM are welcome to apply.</strong><br><br><strong>About the Company</strong></p><p>Bydrec, Inc. is a California-based company that connects top Tech talent from Latin America with U.S. companies looking to expand their development teams. Learn more at <a href="https://bydrec.com/">bydrec.com</a>.<br> </p><p><strong>Position Summary</strong><br>We are seeking a <strong>Digital Health Data Engineer</strong> with expertise in extracting features and analyzing <strong>multimodal time-series data from biosensors</strong> such as accelerometer, ECG, PPG, and EEG. This role will also focus on developing <strong>advanced data pipelines for digital health applications</strong>, including <strong>audio and video data processing</strong>.</p><p>The ideal candidate has strong experience with <strong>Python, cloud-native architectures, AWS Batch, containerization, and time-series databases</strong>. This person will lead <strong>data exploration initiatives</strong>, drive <strong>technical innovation</strong>, and collaborate across teams to advance healthcare solutions through the development of <strong>digital biomarkers</strong>.</p><p><br><strong>Responsibilities</strong></p><ul><li><strong>Design, build, and maintain data pipelines</strong> to ensure seamless integration and high-performance processing of <strong>large-scale time-series datasets</strong>.</li><li><strong>Develop rapid QC metrics</strong> used in dashboards to present <strong>complex datasets to stakeholders</strong>.</li><li>Provide <strong>Python expertise</strong>, supporting team members with queries and troubleshooting while promoting <strong>best practices in code quality and development</strong>.</li><li><strong>Communicate insights and results</strong> through reports, presentations, and technical documentation.</li></ul><p><br><strong>Qualifications<br><br>Required:</strong></p><ul><li>Bachelor’s degree with <strong>5+ years of industry experience</strong>, or a Master’s degree with <strong>3+ years of experience</strong>, in <strong>Computer Science, Data Science, Bioinformatics, or a related quantitative field</strong>.</li><li><strong>Advanced English proficiency</strong> (written and spoken).</li><li>Strong proficiency in <strong>Python</strong>, with the ability to mentor and support the team in solving <strong>complex Python-related challenges</strong>. <strong>Proficiency in R</strong> is also required.</li><li>Experience in <strong>data visualization for complex and large-scale datasets</strong>, particularly <strong>time-series data</strong>, using tools such as <strong>Power BI</strong>.</li><li>Expertise in <strong>SQL, PySpark, and Ray clusters</strong> for data engineering and large-scale analysis.</li><li>Experience with <strong>containerization tools such as Docker</strong> and deploying data workflows in modern environments.</li><li>Hands-on experience with <strong>machine learning</strong>, particularly with <strong>large datasets</strong>.</li><li>Familiarity with <strong>cloud technologies</strong>, including <strong>AWS and Snowflake</strong>.</li><li>Experience working with <strong>multimodal time-series biosensor data</strong> such as <strong>accelerometer, ECG, PPG, or EEG</strong>.</li><li>Strong <strong>communication skills</strong>, with the ability to collaborate across teams and present complex technical concepts clearly.</li></ul><p><br><strong>Nice to Have:</strong></p><ul><li>Familiarity with <strong>large language models (LLMs)</strong> and applying <strong>generative AI approaches such as Retrieval-Augmented Generation (RAG)</strong> in <strong>digital health environments</strong>.</li><li>Knowledge of <strong>GPU computing, high-performance computing, and cloud-native architectures</strong>.</li><li>Experience with <strong>relational and cloud databases</strong>, including <strong>PostgreSQL and Redshift</strong>.</li><li>Hands-on experience managing and optimizing <strong>cloud infrastructure</strong> using <strong>AWS and Snowflake</strong>, including tools such as <strong>Kubernetes, AWS Bedrock, AWS Batch, Athena, AWS Glue, KIRO, and S3 integrations</strong>.</li><li>Familiarity with <strong>cardiovascular, neuroscience, or epidemiology datasets</strong>.</li><li>Experience with <strong>FDA submissions, validation processes, and working within GxP environments</strong>.</li></ul>

Back to blog