New Reliability Dashboards and new Performance Monitoring Features Follow

With Version 4.29 two major new feature were made available to you.

Introducing our new Reliability Dashboards!

We’re excited to be debuting our new Reliability Scoring capabilities that deepen enterprise visibility across both pre-production and production environments, and enable you to automatically identify and prioritize anomalies prior to a release in order to avoid promotion of bad code.

As the pace of software delivery quickens, the risk of poor application quality increases. We help development, QA and operations teams balance this speed-stability paradox by leveraging our unique real estate inside an application to detect anomalies that would otherwise go unnoticed by existing monitoring tools. Utilizing micro-agents that operate between application code and the hardware, we can capture data that was never previously available, then deduplicate, classify and gate critical issues from moving into staging and production.

Our new Reliability Dashboards help organizations visualize this data in out-of-the-box Grafana dashboards or via any tool of your choice via open REST APIs, so they can see at a glance where the issues are and drill in deeper into the cause with one click. Executives can see instantly if the quality of code is improving across teams, applications or services to identify areas of weakness that may require additional attention or resources.

Highlights of the new OverOps Reliability Dashboards include:

Reliability Scorecards and Release Certification

The OverOps Reliability Scorecard allows DevOps teams to observe the reliability of their environment at the highest level and triage critical issues that need attention. Each deployment, application and infrastructure tier is assigned its own dynamic score derived from the detection, classification and prioritization of all anomalies – including newly introduced errors, increasing errors and performance slowdowns.

Using these scores, organizations can certify releases to be moved through their delivery pipeline, or stop them in their tracks to proactively fix any issues. Through the new Jenkins integration, QA teams can see all new anomalies introduced by any release in test or stage, and automatically assign it a severity based on its potential impact to the code. OverOps will certify each release based on how many issues it introduced, and can automatically stop a bad release from being promoted, if it’s risky, sending it back to the engineers with True Root Cause.

True Root Cause Drill-Down

From the Reliability Scorecard dashboard, users can drill into the details of low scoring deployments, applications or infrastructure tiers, like AWS or RDBMs. The Reliability Analysis dashboard shows the corresponding anomalies and allows users to click straight into the True Root Cause screen using OverOps’ ARC AI technology, where they can view the code and variable state at the moment of an error across the entire call stack, as well environment state and DEBUG-level statements. With this complete context, QA, DevOps and SRE teams can easily route issues back to the right developer, arming them with all the context needed to fix the error – programmatic or operational.

Reliability Trends Over Time

The OverOps Reliability Trends dashboard provides a simple and effective way of comparing releases, or two instances of an application running on different nodes, to identify patterns. Building on this capability, the dashboard provides executives with an easy way to see how well their applications and deployments are doing over time with respect to error volume, unique error count, newly introduced or increasing errors, and slowdowns. At a glance, VPs and CXOs can see which applications are falling behind, as well as which teams require more attention. By understanding application quality release over release, executives can make informed decisions about resources and protect application revenue and customer experience.


For more details please see the following links: 

Reliability Dashboard Documentation 

The Reliability Dashboards

The Reliability Scorecard


To access our new dashboards click on the new   button on the top navigation toolbar.


Introducing our new Performance Monitoring Feature!

As of this release we are now automatically adding timers to measure transaction execution-time and locate slowdowns in your application.

These timers can be enabled in the Admin view called Performance Slowdowns by configuring our new UDF "Automatic entry point timers".

Any identified "slowdowns events" can be viewed from “My Timers” view in your Dashboard.
See Automatic Timers and Slowdown Snapshots for details!


For additional details see V4.29