Argyll Infotech Enterprise Pvt Ltd
"Platform Engineer- Enterprise Monitoring"
Argyll Infotech Enterprise Pvt Ltd, Dallas, Texas, United States, 75215
1 day ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Platform Engineers translate high level platform design into low level technical design and are responsible for implementing, administering, supporting and patching their corresponding platforms. Platform Engineers work closely with Solution Architects to enable the capabilities defined on roadmaps and blueprints supporting platform programs and initiatives. Platform Engineers are well versed in modern data, infrastructure and integration platforms, industry/technology best practices and actively work on improving the reliability and scalability of infrastructure.
The Enterprise Monitoring (EM) team Platform Engineer will be responsible for the design, deployment and management of robust and scalable Monitoring and Logging platforms for the Enterprise. This role involves the execution, maintenance and delivery of new features/capabilities across multiple monitoring and observability solutions, to ensure end-to-end visibility into the health, performance and availability of applications and platforms. We are looking for a highly motivated, technically savvy, business focused Platform Engineer who has the skills to work in a highly collaborative and fast-paced environment and is willing to adapt to multiple strategic priorities to deliver high quality deliverables.
This position will be filled onsite in Issaquah, WA or Dallas, TX.
Job Duties/Essential Functions
Designs, implements and maintains monitoring platforms across multi-cloud (Azure, GCP etc) and on-prem environments. Manages and optimizes observability solutions (such as Dynatrace, Prometheus, OpenTelemetry etc) for end-to-end
systems visibility.
Assesses technical components, translates high level design into low level technical design and executes updates based
on SLAs.
Integrates logging solutions and ensures proper ingestion, parsing and indexing of logs across platforms. Administers Enterprise Monitoring tools such as SCOM, IBM Tivoli Monitoring, HP Operations Agent, Network Node
Manager (NNMi) and Dynatrace to ensure continuous availability and performance.
Develops automation scripts (e.g: Powershell, Python etc) to streamline alerting, onboarding and configuration tasks. Manages the platform on an ongoing basis while performing typical run functions like monitoring, patching and support. Leads and conducts code reviews, design reviews, testing, and debugging activities at the application level. Conducts regular platform check ups and reports inefficiencies to relevant business and technology stakeholders. Develops and instruments monitoring dashboards depicting platform health and performance. Develops and builds scalable and generalized frameworks to support the integration of internal and third-party APIs. Collaborates with DevOps, SRE, Cloud Engineering and Application teams to define and implement SLIs, SLOs, alerts, reports and dashboards. Troubleshoots monitoring gaps and issues across platforms and provide root cause analysis and resolution guidance to
teams.
Maintains platform upgrades, patching and configurations in line with compliance and security requirements. Participates in the creation of documentation and artifacts used to describe the mechanisms used for deployment,
monitoring, maintenance and best practices for platform usage, configuration and alert tuning.
Supports incident response, resolution and post-incident reviews for Production issues to identify failure points or
performance degradation.
Performs coding tests to validate hardware design correctness, and creates software regression tests to ensure its
reliability.
Conducts research and makes recommendations on standards, products, and services. Tests diagnostics (including) automated regression testing. Develops and executes integration testing plans (as needed). Evaluates and documents all operating systems according to required standards. Participates in the development of continuous integration/continuous development frameworks in support of DevOps and
Agile practices.
Regular and reliable workplace attendance at your assigned location.
Ability to operate vehicles, equipment or machinery
Computer, fax, phone, copier, printer
Non-Essential Functions
Assists in other areas of the department as necessary. Assists in other areas of the company as necessary.
Ability to operate vehicles, equipment or machinery
Same as Essential Functions.
Required
Experience, Skills, Education & Licenses/Certifications
Experience with Cloud Platforms, Saas Products, On-Prem solutions, and end to end development and delivery roles and
responsibilities.
Hands-on experience with Enterprise Monitoring tools such as Dynatrace, Google Cloud Logging, SCOM, IBM Tivoli, HP
Operations Agent, Network Node Manager (NNMi) or equivalent.
Working knowledge of Observability frameworks such as OpenTelemetry, Grafana, Prometheus, or equivalent. Working knowledge of data visualization tools such as MS PowerBI, Google Looker, etc. Proficiency in IaC scripting using tools such as Terraform or equivalent. Proficiency with scripting languages such as MySQL, PowerShell, Python, or equivalent. Proficient with OS utilities. Holds certifications in Cloud technologies such as Azure, Google Cloud Platform, or equivalent. Excellent verbal and written communication skills. Foundational networking knowledge. Excellent analytical skills and ability to effectively troubleshoot and provide solutions. Ability to work both independently and within a close team environment. Scheduling flexibility to meet the needs of the business, including weekends, holidays, and 24/7 on call responsibilities on a rotational basis.
Recommended
Experience with RedHat OpenShift, Docker/Container, MS Azure, IaaS or PaaS solutions. Experience with CI/CD orchestration tools e.g., Jenkins, Maven or similar CI/CD. Experience with WebSphere Platform and application deployments. Proficient in Google Workspace applications, including Sheets, Docs, Slides, and Gmail. Successful internal candidates will have spent one year or more on their current team.
Other Conditions
Management will review the Job Analysis for this position prior to a job offer Seniority level
Seniority level
Executive Employment type
Employment type
Full-time Job function
Job function
Engineering and Information Technology Industries
IT Services and IT Consulting Referrals increase your chances of interviewing at Argyll Infotech Enterprise Pvt Ltd by 2x Get notified about new Platform Engineer jobs in
Dallas, TX . Sr. Software Engineer, Platform Orchestration - Slack
Dallas, TX $110,000.00-$160,000.00 3 weeks ago Irving, TX $83,912.00-$120,000.00 1 day ago Engineering - API Platform - Software Engineer - Analyst - Dallas
Dallas, TX $180,000.00-$198,000.00 3 weeks ago Plano, TX $92,500.00-$143,500.00 3 days ago Dallas, TX $174,000.00-$247,000.00 2 months ago Irving, TX $83,912.00-$115,080.00 1 day ago Plano, TX $120,000.00-$165,000.00 2 days ago Dallas, TX $104,000.00-$130,000.00 1 day ago Dallas, TX $100,000.00-$105,000.00 17 hours ago Dallas, TX $90,000.00-$115,000.00 1 day ago AWS Data Engineer - Fully Remote - US Only
Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
The Enterprise Monitoring (EM) team Platform Engineer will be responsible for the design, deployment and management of robust and scalable Monitoring and Logging platforms for the Enterprise. This role involves the execution, maintenance and delivery of new features/capabilities across multiple monitoring and observability solutions, to ensure end-to-end visibility into the health, performance and availability of applications and platforms. We are looking for a highly motivated, technically savvy, business focused Platform Engineer who has the skills to work in a highly collaborative and fast-paced environment and is willing to adapt to multiple strategic priorities to deliver high quality deliverables.
This position will be filled onsite in Issaquah, WA or Dallas, TX.
Job Duties/Essential Functions
Designs, implements and maintains monitoring platforms across multi-cloud (Azure, GCP etc) and on-prem environments. Manages and optimizes observability solutions (such as Dynatrace, Prometheus, OpenTelemetry etc) for end-to-end
systems visibility.
Assesses technical components, translates high level design into low level technical design and executes updates based
on SLAs.
Integrates logging solutions and ensures proper ingestion, parsing and indexing of logs across platforms. Administers Enterprise Monitoring tools such as SCOM, IBM Tivoli Monitoring, HP Operations Agent, Network Node
Manager (NNMi) and Dynatrace to ensure continuous availability and performance.
Develops automation scripts (e.g: Powershell, Python etc) to streamline alerting, onboarding and configuration tasks. Manages the platform on an ongoing basis while performing typical run functions like monitoring, patching and support. Leads and conducts code reviews, design reviews, testing, and debugging activities at the application level. Conducts regular platform check ups and reports inefficiencies to relevant business and technology stakeholders. Develops and instruments monitoring dashboards depicting platform health and performance. Develops and builds scalable and generalized frameworks to support the integration of internal and third-party APIs. Collaborates with DevOps, SRE, Cloud Engineering and Application teams to define and implement SLIs, SLOs, alerts, reports and dashboards. Troubleshoots monitoring gaps and issues across platforms and provide root cause analysis and resolution guidance to
teams.
Maintains platform upgrades, patching and configurations in line with compliance and security requirements. Participates in the creation of documentation and artifacts used to describe the mechanisms used for deployment,
monitoring, maintenance and best practices for platform usage, configuration and alert tuning.
Supports incident response, resolution and post-incident reviews for Production issues to identify failure points or
performance degradation.
Performs coding tests to validate hardware design correctness, and creates software regression tests to ensure its
reliability.
Conducts research and makes recommendations on standards, products, and services. Tests diagnostics (including) automated regression testing. Develops and executes integration testing plans (as needed). Evaluates and documents all operating systems according to required standards. Participates in the development of continuous integration/continuous development frameworks in support of DevOps and
Agile practices.
Regular and reliable workplace attendance at your assigned location.
Ability to operate vehicles, equipment or machinery
Computer, fax, phone, copier, printer
Non-Essential Functions
Assists in other areas of the department as necessary. Assists in other areas of the company as necessary.
Ability to operate vehicles, equipment or machinery
Same as Essential Functions.
Required
Experience, Skills, Education & Licenses/Certifications
Experience with Cloud Platforms, Saas Products, On-Prem solutions, and end to end development and delivery roles and
responsibilities.
Hands-on experience with Enterprise Monitoring tools such as Dynatrace, Google Cloud Logging, SCOM, IBM Tivoli, HP
Operations Agent, Network Node Manager (NNMi) or equivalent.
Working knowledge of Observability frameworks such as OpenTelemetry, Grafana, Prometheus, or equivalent. Working knowledge of data visualization tools such as MS PowerBI, Google Looker, etc. Proficiency in IaC scripting using tools such as Terraform or equivalent. Proficiency with scripting languages such as MySQL, PowerShell, Python, or equivalent. Proficient with OS utilities. Holds certifications in Cloud technologies such as Azure, Google Cloud Platform, or equivalent. Excellent verbal and written communication skills. Foundational networking knowledge. Excellent analytical skills and ability to effectively troubleshoot and provide solutions. Ability to work both independently and within a close team environment. Scheduling flexibility to meet the needs of the business, including weekends, holidays, and 24/7 on call responsibilities on a rotational basis.
Recommended
Experience with RedHat OpenShift, Docker/Container, MS Azure, IaaS or PaaS solutions. Experience with CI/CD orchestration tools e.g., Jenkins, Maven or similar CI/CD. Experience with WebSphere Platform and application deployments. Proficient in Google Workspace applications, including Sheets, Docs, Slides, and Gmail. Successful internal candidates will have spent one year or more on their current team.
Other Conditions
Management will review the Job Analysis for this position prior to a job offer Seniority level
Seniority level
Executive Employment type
Employment type
Full-time Job function
Job function
Engineering and Information Technology Industries
IT Services and IT Consulting Referrals increase your chances of interviewing at Argyll Infotech Enterprise Pvt Ltd by 2x Get notified about new Platform Engineer jobs in
Dallas, TX . Sr. Software Engineer, Platform Orchestration - Slack
Dallas, TX $110,000.00-$160,000.00 3 weeks ago Irving, TX $83,912.00-$120,000.00 1 day ago Engineering - API Platform - Software Engineer - Analyst - Dallas
Dallas, TX $180,000.00-$198,000.00 3 weeks ago Plano, TX $92,500.00-$143,500.00 3 days ago Dallas, TX $174,000.00-$247,000.00 2 months ago Irving, TX $83,912.00-$115,080.00 1 day ago Plano, TX $120,000.00-$165,000.00 2 days ago Dallas, TX $104,000.00-$130,000.00 1 day ago Dallas, TX $100,000.00-$105,000.00 17 hours ago Dallas, TX $90,000.00-$115,000.00 1 day ago AWS Data Engineer - Fully Remote - US Only
Were unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr