Receive and log service requests and incident reports to the system
Triage requests and incidents to respective groups Monitoring
Monitor core ingestion and processing applications
Verify core ingestion and processing applications are running according to the schedule
Routine log analysis of core ingestion and processing applications for anomalies and act to resolve them.
Start/stop/restart failed jobs where known issues exist and clear instructions are available in the runbook to provide guidance on how to resolve the issue.
Verify external tool data access connections are operating as expected.
Diagnosis and problem analysis
Review application logs of long-running applications/jobs to identify the root cause
Review job run times of core ingestion and processing applications/jobs and compare and verify they are meeting expected SLAs. Document and report skew against historical data.
Review incidents weekly for commonalities and recommend an approach to reduce the incident count. Support work efforts for addressing application issues.
Operations Manual Maintenance
Provide regular updates to the Operations Manual as required
Hiring criteria
You should have or be completing the following to apply for this opportunity.
Entry pathway
Degree or Certificate
Minimum Level of Study
Bachelor or higher
Study Field
I
Artificial Intelligence
Bioinformatics
Computer Graphics & Animation
Computer Science (all other)
Computer Systems and Networks
Cyber Security
Work rights
The opportunity is available to applicants in any of the following categories.