Job Description :
JD :
Data Analytics Architect with 10 years+ experience in ENTERPRISE MONITORING & REPORTING.
Programming experience with GO, PYTHON, PERL.
Experience in Analytics, MONITORING, ALERTING & REPORTING AND TIME SERIES DATABASE - PROMETHEUS, GRAFANA, ELASTIC SEARCH etc.
PromQL language to scrape metrics
Deploy automated monitoring processes to Collect Metrics For All Data Pipelines using standard tools and technologies - e.g. PROMETHEUS, GRAFANA
A strong mixture of languages such as Core Java, Python, and familiar with these technologies: KUBERNETES, AWS, SPRINGBOOT, ELASTIC SEARCH, PROMETHEUS, GRAFANA, JAEGER, GRAPHITE.
Should be well versed in CLOUD DEPLOYMENTS (TERRAFORM/CLOUD FORMATION), operating systems (namely CENTOS) AND RELIABILITY (N+1 ARCHITECTURES, DATA BACKUP, MONITORING OF MONITORING
Work activities would also include :
Understand the client application stack / data sources and data elements
DESIGN, IMPLEMENT AND OPTIMIZE QUERIES AND DATA MODELS
Works with client teams to Refine Observability And Metrics
Define the targets to be scraped and the time-interval for Scraping Metrics
DEFINING THE ALERT RULES to fire alerts at appropriate TIMES USING PROMETHEUS ALERT MANAGER
Help the team with analyzing, identifying, and Tuning Dashboards
Create Modular /Scripted inputs
Create COMMON INFORMATION MODEL FOR DATA
Create required KNOWLEDGE OBJECTS FROM THE PROMETHEUS DATA
DATA SCRAPING, COLLECTING AND STORING THE DATA LOGS INTO PROMETHEUS
DATA INSTRUMENTATION to extract information from metrics using various exporters
CREATE ALERTS, define thresholds and triggers using Prometheus
INTEGRATE GRAFANA WITH PROMETHEUS AND CREATE DASHBOARDS as per the use case
Continuous monitoring of applications and servers for exceptions, server CPU & memory usage, or storage spikes
Monitor the operational characteristics and collect metrics for all the data pipelines using Grafana. Ensuring the statistics remains within acceptable limits
Setting up Prometheus server with Node Exporter and Grafana to visualize Centos OS key stats
Defining and developing meaningful metrics by querying from the Prometheus’s time-series database using PromQL
             

Similar Jobs you may be interested in ..