Jobs via Dice
Kforce has an enterprise client seeking a
Systems Engineer IV
in Nashville, TN. RESPONSIBILITIES: Drive technical innovation and efficiency in infrastructure operations via automation Design server monitoring and management solutions using automation and self-repair Create processes that enhance operational workflow and provide positive customer impact Dive deep to resolve problems at their root, looking for failure patterns amenable to long-term solutions via simplification and automation Avoid re-inventing the wheel and prefer appropriately simple, repeatable solutions over more complex and failure prone ones Recognize and adopt best practices in documentation, testing, security, operational support at scale, and efficient use of resources Develop appropriate metrics to demonstrate performance at improving operational efficiency REQUIREMENTS: 5+ years of experience in AI/ML development 3+ years of experience implementing ML/DL algorithms in Python (PyTorch, Keras, scikit-learn) 2+ years of experience building Generative AI applications (LLM-driven solutions in production) 1+ years of experience deploying production AI agents at scale Experience with PyTorch for advanced research and model customization Experience with Keras for rapid prototyping and TensorFlow integration Strong background in AWS cloud-native solutions for ML/AI (SageMaker, Bedrock, Lambda, ECS, EKS) Experience with industrial systems integration and protocols (OPC-UA, Modbus, MQTT, REST APIs) Understand how commodity servers, operating systems and networks function, perform and scale Possess superb troubleshooting, project management and problem analysis skills Technical Environment: Programming & Scripting: Python (primary), Bash, SQL ML/AI Frameworks: PyTorch, TensorFlow, Keras, scikit-learn Agent Frameworks: LangChain, AutoGPT, CrewAI AWS Services: Compute: EC2, Lambda, ECS/EKS Data: S3, Glue, Athena, Redshift, DynamoDB, Timestream AI/ML: SageMaker, Bedrock, Kendra, OpenSearch Vector DB Messaging/Streaming: Kinesis, SQS/SNS, EventBridge Infra & Security: IAM, VPC, CloudFormation/CDK, AWS-SDK, CloudWatch, Step Functions Additional: Databases: Redshift, PostgreSQL, MySQL, DynamoDB, Timestream Visualization: Matplotlib, Plotly, Grafana, QuickSight CI/CD & DevOps: GitHub/GitLab CI, Docker, Terraform/CDK Industrial/Edge: OPC-UA, MQTT, REST APIs for IoT/industrial data Kforce is an
Equal Opportunity/Affirmative Action Employer . All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status. We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees.
#J-18808-Ljbffr
Systems Engineer IV
in Nashville, TN. RESPONSIBILITIES: Drive technical innovation and efficiency in infrastructure operations via automation Design server monitoring and management solutions using automation and self-repair Create processes that enhance operational workflow and provide positive customer impact Dive deep to resolve problems at their root, looking for failure patterns amenable to long-term solutions via simplification and automation Avoid re-inventing the wheel and prefer appropriately simple, repeatable solutions over more complex and failure prone ones Recognize and adopt best practices in documentation, testing, security, operational support at scale, and efficient use of resources Develop appropriate metrics to demonstrate performance at improving operational efficiency REQUIREMENTS: 5+ years of experience in AI/ML development 3+ years of experience implementing ML/DL algorithms in Python (PyTorch, Keras, scikit-learn) 2+ years of experience building Generative AI applications (LLM-driven solutions in production) 1+ years of experience deploying production AI agents at scale Experience with PyTorch for advanced research and model customization Experience with Keras for rapid prototyping and TensorFlow integration Strong background in AWS cloud-native solutions for ML/AI (SageMaker, Bedrock, Lambda, ECS, EKS) Experience with industrial systems integration and protocols (OPC-UA, Modbus, MQTT, REST APIs) Understand how commodity servers, operating systems and networks function, perform and scale Possess superb troubleshooting, project management and problem analysis skills Technical Environment: Programming & Scripting: Python (primary), Bash, SQL ML/AI Frameworks: PyTorch, TensorFlow, Keras, scikit-learn Agent Frameworks: LangChain, AutoGPT, CrewAI AWS Services: Compute: EC2, Lambda, ECS/EKS Data: S3, Glue, Athena, Redshift, DynamoDB, Timestream AI/ML: SageMaker, Bedrock, Kendra, OpenSearch Vector DB Messaging/Streaming: Kinesis, SQS/SNS, EventBridge Infra & Security: IAM, VPC, CloudFormation/CDK, AWS-SDK, CloudWatch, Step Functions Additional: Databases: Redshift, PostgreSQL, MySQL, DynamoDB, Timestream Visualization: Matplotlib, Plotly, Grafana, QuickSight CI/CD & DevOps: GitHub/GitLab CI, Docker, Terraform/CDK Industrial/Edge: OPC-UA, MQTT, REST APIs for IoT/industrial data Kforce is an
Equal Opportunity/Affirmative Action Employer . All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status. We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees.
#J-18808-Ljbffr