Operations Engineer RealMed
Availity delivers revenue cycle and related business solutions for health care professionals who want to build healthy, thriving organizations. Availity has the powerful tools, actionable insights and expansive network reach that medical businesses need to get an edge in an industry constantly redefined by change.
The ideal candidate will have maintained proprietary applications in a high-availability production environment and have the ability to rapidly self-educate on new concepts and methods.
In Availity?s role as a Health Information Network provider, connecting health care providers with health plans and enabling the smooth flow of information is paramount. Careful monitoring of the availability and performance of systems at the infrastructure, middleware and application levels is a crucial piece of finding and fixing issues ? before they impact our customers.
The focus of this role will be on implementing and maintaining these monitoring and alerting solutions. You should be passionate about high availability, high service performance systems, and enjoy being a front-line responder for analysis and troubleshooting of critical issues impacting hundreds of thousands of users.
Principle: 100% customer satisfaction
Responsible for planning, designing, and implementing the appropriate monitoring and alerting solutions needed to ensure the stability, integrity, and efficient operation of enterprise information systems.
Monitor system and application performance from bandwidth to availability to response times
Work closely with engineering teams to understand the operational and performance needs of new products
Heavy involvement in all aspects of 24/7 production data-center operations
Participate in on-call rotation as needed
Investigate and recommend ways to more elegantly/efficiently enhance processing time, reliability, scalability and ease of deployment
Produce and maintain documentation on installations, incidents, SOP?s, and FAQs through Confluence
Contribute to planning efforts for disaster recovery, capacity expansion, component upgrading and system hardening
Maintain data center operation procedures in collaboration with Engineering staff and Client Services staff
WORK EXPERIENCE & SKILLS (Required)
5+ years of operations experience with large-scale, high-availability distributed systems
Expert hands-on knowledge of Linux and scripting (Bash, Python, Perl, etc)
Strong knowledge of Unix/Linux Operating systems
Strong troubleshooting, problem-solving and analytical skills
Proven track record troubleshooting critical issues on complex, high-traffic systems
Ability to analyze and optimize all major aspects of server-side performance
Ability to implement solid script-based solutions for system tasks (PXE, kickstart, etc)
Self-starter who manages their own priorities and activities
Excellent organizational skills and discipline to follow through
Ability to communicate technical information to non-technical personnel
Experienced in troubleshooting various Systems
You love working on small teams with a lot of responsibility and several different hats
You're able to tackle complex issues with a strong sense of urgency and ownership
WORK EXPERIENCE & SKILLS (Preferred)
Experience in technical project management relating to the implementation of enterprise applications.
Deep knowledge of monitoring and related tools (IT360, Splunk, IPMI, Nagios, Trac, JIRA, etc)
Performance testing and tuning experience
Software development experience a plus
Industrial engineering / continuous improvement / lean six sigma experience a plus
EDUCATION AND CERTIFICATION (Required)
Degree/diploma in Information Systems or related industry with 2 years additional experience.
EDUCATION AND CERTIFICATION (Preferred)
BS/MS in CS or Engineering discipline
||Jacksonville, FL |