FastTek Global
Dearborn, Michigan
Platform Engineer #1043620
Employees in this job function are focused on developing and maintaining reusable software components that serve the needs of product developers in the organization. They are responsible for designing, implementing, integrating and maintaining the underlying infrastructure and software applications that support developer productivity and self‑service.
Key Responsibilities
Collaborate with enterprise architects, software architects, software engineering teams, etc. to design the platform infrastructure and tools encompassing servers, networks, storage, databases, cloud services, etc.
Implement and manage the infrastructure that supports the platform tools and ensuring that upgrades, security patches and other performance improvements are regularly performed.
Evaluate cloud providers, containerization solutions and other complex technologies to deeply understand the configurations available and create abstractions of common configurations that can be utilized easily by application teams for their workloads.
Write and execute automated Infrastructure as Code scripts and utilize processes like CI/CD to streamline and automate how the platform infrastructure is provisioned, configured and managed to improve consistency, traceability and repeatability.
Integrate performance and monitoring best practices including QoS and SLA metrics to scale platform applications and managed services automatically due to demand.
Incorporate security and disaster recovery best practices into the infrastructure applications by integrating access control, identity management, logging/monitoring, public/private network configurations, data encryption, storage backup and disaster recovery, etc.
Facilitate the integration of enterprise managed software configurations into deployment pipelines managed by application teams to ensure approved configurations and best practices around security, networking, logging & monitoring, performance & scale, etc. are applied.
Advocate feedback with service providers and developers, to ensure the platform continues to grow and evolve to meet their needs.
Skills Required
Scripting
Automation
Root Cause Analysis
Troubleshooting (Problem Solving)
Cloud Architecture
IT Solutions
GitHub
Cloud Infrastructure
Change Management
Technical Analysis
Developer
Tekton
Utilization Management
Kubernetes
Skills Preferred
Ansible
GCP
Dynatrace
PowerShell
Access Controls
Python
Information Security
VMware
Experience Required
Engineer Exp: Prac. In 2 coding lang. or adv.
In 1 lang.
6+ years in IT;
4+ years in development
Education Required
Associate Degree
College Senior
Education Preferred
Certification Program
Bachelor's Degree
Additional Information
Conduct capacity planning and forecasting for the OpenShift Virtualization platform, including compute, memory, storage, and network resources, to ensure scalability and prevent resource exhaustion.
Analyze resource utilization trends and make recommendations for infrastructure scaling, consolidation, or optimization.
Collaborate with application teams and stakeholders to understand future demand and project capacity needs.
Develop and maintain capacity models and reports to support strategic planning.
Develop automation solutions (scripts, playbooks) for repetitive OSV tasks, including configuration changes, VM management, auditing, remediation and integration with ticketing systems.
Leverage automation to enable delivering operator updates and changes efficiently at scale.
Implement Site Reliability Engineering (SRE) principles and practices to improve overall platform stability, performance, and operational efficiency.
Role Based Access Control deployment and auditing.
Namespace and Resource Quota management.
Implement and maintain comprehensive end‑to‑end observability solutions (monitoring, logging, tracing) for the OSV environment, including integration with tools like Dynatrace and Prometheus/Grafana.
Explore and implement Event Driven Architecture (EDA) for enhanced real‑time monitoring and response.
Develop capabilities to flag and report abnormalities and identify "blind spots" in observability.
Perform deep dive Root Cause Analysis (RCA), potentially utilizing available tooling, to quickly identify and resolve issues across the global compute environment.
Find the needle in a haystack/unhealthy bits in the compute universe (globally) for faster time to resolution.
Monitor VM health, resource usage, and performance metrics proactively.
Monitor for unusual activity that might indicate a compromise or misconfiguration.
Solution Design & Consulting.
Knowledge Management.
Benefits
Medical and Dental (FastTek pays majority of the medical program)
Vision
Personal Time Off (PTO) Program
Long Term Disability (100% paid)
Life Insurance (100% paid)
401(k) with immediate vesting and 3% (of salary) dollar‑for‑dollar match
AI & Hiring Disclosure We use AI tools to support parts of our hiring process, such as reviewing applications and identifying potential matches. These tools are designed to promote efficiency, consistency, and fairness, and they are always used under human oversight.
All personal data collected is used solely for recruitment purposes, and you have the right to know, access, or request deletion of your data at any time, subject to legal limits.
If AI will be used in a video interview, you'll be informed in advance and asked for your consent, with the option to opt out.
Our tools are regularly reviewed to detect potential bias and to ensure compliance with all applicable laws and our commitment to inclusive hiring.
To learn more or exercise your rights, please contact us at info@fasttek.com.
#J-18808-Ljbffr
Platform Engineer #1043620
Employees in this job function are focused on developing and maintaining reusable software components that serve the needs of product developers in the organization. They are responsible for designing, implementing, integrating and maintaining the underlying infrastructure and software applications that support developer productivity and self‑service.
Key Responsibilities
Collaborate with enterprise architects, software architects, software engineering teams, etc. to design the platform infrastructure and tools encompassing servers, networks, storage, databases, cloud services, etc.
Implement and manage the infrastructure that supports the platform tools and ensuring that upgrades, security patches and other performance improvements are regularly performed.
Evaluate cloud providers, containerization solutions and other complex technologies to deeply understand the configurations available and create abstractions of common configurations that can be utilized easily by application teams for their workloads.
Write and execute automated Infrastructure as Code scripts and utilize processes like CI/CD to streamline and automate how the platform infrastructure is provisioned, configured and managed to improve consistency, traceability and repeatability.
Integrate performance and monitoring best practices including QoS and SLA metrics to scale platform applications and managed services automatically due to demand.
Incorporate security and disaster recovery best practices into the infrastructure applications by integrating access control, identity management, logging/monitoring, public/private network configurations, data encryption, storage backup and disaster recovery, etc.
Facilitate the integration of enterprise managed software configurations into deployment pipelines managed by application teams to ensure approved configurations and best practices around security, networking, logging & monitoring, performance & scale, etc. are applied.
Advocate feedback with service providers and developers, to ensure the platform continues to grow and evolve to meet their needs.
Skills Required
Scripting
Automation
Root Cause Analysis
Troubleshooting (Problem Solving)
Cloud Architecture
IT Solutions
GitHub
Cloud Infrastructure
Change Management
Technical Analysis
Developer
Tekton
Utilization Management
Kubernetes
Skills Preferred
Ansible
GCP
Dynatrace
PowerShell
Access Controls
Python
Information Security
VMware
Experience Required
Engineer Exp: Prac. In 2 coding lang. or adv.
In 1 lang.
6+ years in IT;
4+ years in development
Education Required
Associate Degree
College Senior
Education Preferred
Certification Program
Bachelor's Degree
Additional Information
Conduct capacity planning and forecasting for the OpenShift Virtualization platform, including compute, memory, storage, and network resources, to ensure scalability and prevent resource exhaustion.
Analyze resource utilization trends and make recommendations for infrastructure scaling, consolidation, or optimization.
Collaborate with application teams and stakeholders to understand future demand and project capacity needs.
Develop and maintain capacity models and reports to support strategic planning.
Develop automation solutions (scripts, playbooks) for repetitive OSV tasks, including configuration changes, VM management, auditing, remediation and integration with ticketing systems.
Leverage automation to enable delivering operator updates and changes efficiently at scale.
Implement Site Reliability Engineering (SRE) principles and practices to improve overall platform stability, performance, and operational efficiency.
Role Based Access Control deployment and auditing.
Namespace and Resource Quota management.
Implement and maintain comprehensive end‑to‑end observability solutions (monitoring, logging, tracing) for the OSV environment, including integration with tools like Dynatrace and Prometheus/Grafana.
Explore and implement Event Driven Architecture (EDA) for enhanced real‑time monitoring and response.
Develop capabilities to flag and report abnormalities and identify "blind spots" in observability.
Perform deep dive Root Cause Analysis (RCA), potentially utilizing available tooling, to quickly identify and resolve issues across the global compute environment.
Find the needle in a haystack/unhealthy bits in the compute universe (globally) for faster time to resolution.
Monitor VM health, resource usage, and performance metrics proactively.
Monitor for unusual activity that might indicate a compromise or misconfiguration.
Solution Design & Consulting.
Knowledge Management.
Benefits
Medical and Dental (FastTek pays majority of the medical program)
Vision
Personal Time Off (PTO) Program
Long Term Disability (100% paid)
Life Insurance (100% paid)
401(k) with immediate vesting and 3% (of salary) dollar‑for‑dollar match
AI & Hiring Disclosure We use AI tools to support parts of our hiring process, such as reviewing applications and identifying potential matches. These tools are designed to promote efficiency, consistency, and fairness, and they are always used under human oversight.
All personal data collected is used solely for recruitment purposes, and you have the right to know, access, or request deletion of your data at any time, subject to legal limits.
If AI will be used in a video interview, you'll be informed in advance and asked for your consent, with the option to opt out.
Our tools are regularly reviewed to detect potential bias and to ensure compliance with all applicable laws and our commitment to inclusive hiring.
To learn more or exercise your rights, please contact us at info@fasttek.com.
#J-18808-Ljbffr