Cercle
Overview
Position: ML/AI Engineer
Location: Remote (U.S. preferred, with hubs in SF, NYC, Chicago, and Austin)
Cercle Overview & Mission Cercle is an AI technology company focused on advancing healthcare for all women. Our AI platform and tools work together to unlock insights for women's healthcare companies. Current and future customers range from clinics and labs to hospitals and pharmaceuticals. Engineers at Cercle are working at the forefront of AI, ML, data engineering, and women’s healthcare. We\'re solving one of AI\'s most audacious challenges: building autonomous AI agents that can navigate and reason through complex enterprise data. This isn't just another LLM wrapper - we\'re pioneering graph-based AI architectures that transform how healthcare organizations leverage their data at scale.
Key Responsibilities
Architect and ship production-grade AI agents that a) Process billions of healthcare data points with sub-second latency and b) Solve ETL problems in new and innovative ways
Build cutting-edge RAG systems combining vector databases, knowledge graphs, and state-of-the-art embeddings
Optimize model performance and manage server infrastructure for real-time deployment of massive embeddings/graph/vector database infrastructure
Implement MLOps best practices for versioning, testing, deployment, and monitoring
Push the boundaries of what\'s possible with LLMs through advanced fine-tuning, promptengineering, and novel architectures
Design self-healing ML pipelines that scale horizontally across distributed infrastructure
Own the full ML lifecycle from research to production, impacting millions of healthcare decisions
Tech Stack proficiency requirements
Python, PyTorch/TensorFlow, Kubernetes, Neo4j, Pinecone/Weaviate, Apache Spark, Ray, MLflow, Docker, AWS/GCP
Qualifications
5+ years shipping ML systems at scale (not just notebooks - real production systems)
Deep expertise in transformer architectures, embeddings, and modern NLP
Track record of building distributed systems that don\'t break at 3am
Fluency in Python and at least one systems language (Go/Rust/C++)
Experience with vector DBs, graph databases, or both
Comfort working with ambiguity and defining technical direction
Experience with MLOps tools (MLflow, Kubeflow, SageMaker, Vertex AI, or similar)
Hands-on experience with LLMs (e.g., fine-tuning, prompt engineering, embeddings)
Bonus Points
Experience integrating ML models into production applications
Healthcare tech experience (FHIR, HL7, HIPAA compliance)
Experience with heterogeneous data sets
Contributions to open source projects
Experience with graph databases or visualization libraries
You\'ve built something that handled 10k+ RPS
Benefits
Competitive base salary + meaningful stock options
Highly flexible remote work with a global team, and annual company retreats
Medical, dental, vision, and 401k plans
Unlimited PTO in addition to U.S. federal holidays off
The chance to shape cutting-edge AI products from the ground up
If interested, please send your resume to
recruiting@cercle.ai
#J-18808-Ljbffr
Location: Remote (U.S. preferred, with hubs in SF, NYC, Chicago, and Austin)
Cercle Overview & Mission Cercle is an AI technology company focused on advancing healthcare for all women. Our AI platform and tools work together to unlock insights for women's healthcare companies. Current and future customers range from clinics and labs to hospitals and pharmaceuticals. Engineers at Cercle are working at the forefront of AI, ML, data engineering, and women’s healthcare. We\'re solving one of AI\'s most audacious challenges: building autonomous AI agents that can navigate and reason through complex enterprise data. This isn't just another LLM wrapper - we\'re pioneering graph-based AI architectures that transform how healthcare organizations leverage their data at scale.
Key Responsibilities
Architect and ship production-grade AI agents that a) Process billions of healthcare data points with sub-second latency and b) Solve ETL problems in new and innovative ways
Build cutting-edge RAG systems combining vector databases, knowledge graphs, and state-of-the-art embeddings
Optimize model performance and manage server infrastructure for real-time deployment of massive embeddings/graph/vector database infrastructure
Implement MLOps best practices for versioning, testing, deployment, and monitoring
Push the boundaries of what\'s possible with LLMs through advanced fine-tuning, promptengineering, and novel architectures
Design self-healing ML pipelines that scale horizontally across distributed infrastructure
Own the full ML lifecycle from research to production, impacting millions of healthcare decisions
Tech Stack proficiency requirements
Python, PyTorch/TensorFlow, Kubernetes, Neo4j, Pinecone/Weaviate, Apache Spark, Ray, MLflow, Docker, AWS/GCP
Qualifications
5+ years shipping ML systems at scale (not just notebooks - real production systems)
Deep expertise in transformer architectures, embeddings, and modern NLP
Track record of building distributed systems that don\'t break at 3am
Fluency in Python and at least one systems language (Go/Rust/C++)
Experience with vector DBs, graph databases, or both
Comfort working with ambiguity and defining technical direction
Experience with MLOps tools (MLflow, Kubeflow, SageMaker, Vertex AI, or similar)
Hands-on experience with LLMs (e.g., fine-tuning, prompt engineering, embeddings)
Bonus Points
Experience integrating ML models into production applications
Healthcare tech experience (FHIR, HL7, HIPAA compliance)
Experience with heterogeneous data sets
Contributions to open source projects
Experience with graph databases or visualization libraries
You\'ve built something that handled 10k+ RPS
Benefits
Competitive base salary + meaningful stock options
Highly flexible remote work with a global team, and annual company retreats
Medical, dental, vision, and 401k plans
Unlimited PTO in addition to U.S. federal holidays off
The chance to shape cutting-edge AI products from the ground up
If interested, please send your resume to
recruiting@cercle.ai
#J-18808-Ljbffr