JAXenter: OpsRamp today announced the release of its platform for summer 2021. The highlight seems to be advanced predictive alerting capabilities. What does this mean and why is it important to your customers?

Michael Fisher: Alerts let you know when you have an issue or potential issue with your IT infrastructure and services. Our new predictive alerting capabilities put some intelligence behind these alerts by anticipating which alerts recur regularly and turn into performance-impacting incidents that are primarily noise.

We can identify seasonal alert patterns and reduce downtime by anticipating when alerts will increase. And we can reduce incident volumes by scheduling repetitive alerts, freeing up your ITOps teams for more strategic tasks.

SEE ALSO: 8 Factors To Consider When Choosing A Cloud Enterprise Technology For Your Organization

JAXenter: Digging deeper into your new release, it looks like OpsRamp has expanded its monitoring capabilities to include auto monitoring, Alibaba cloud monitoring, and data center monitoring. What is happening to IT operations professionals today that has made you focus more on these areas?

Michael Fisher: The hybrid domain is growing, so we are expanding our monitoring capabilities. Alibaba Cloud may not be that popular in North America, but it’s the # 1 public cloud platform (by market share) in Asia and it’s a $ 300 billion internet economy. by 2025. So if we want to support global customers and global business operations – and we do – we had to add support for that.

But not everything is moving to the public cloud. Data center technologies continue to advance and enable data center operators to deliver cloud functionality. So we’ve extended support for data center infrastructure like storage, networking, and hyperconverged infrastructure from vendors like Hitachi, VMware, Dell-EMC, and Poly.

JAXenter: One last question. What advice do you have for IT Ops professionals who want to move from being responsive in tracking and resolving incidents to being proactive so they don’t happen in the first place? Can you give us / them 2-3 good practices?

Michael Fisher: We advocate for a service-centric approach throughout the incident lifecycle. It starts with hybrid monitoring – a tool to monitor your entire application infrastructure wherever it is, on-premises and multi-cloud – with a mapping of business services to that underlying hybrid infrastructure.

From there, add intelligent event correlation with machine learning, so you don’t get drowned in alerts, but instead can identify when the same event triggered multiple alerts. You need to be able to route these alerts to the right people who can respond to them. And not all of these alerts require human intervention.

You should be able to initiate automated IT processes that can resolve incidents without human intervention, for example by automatically patching a vulnerability on the compute instance or by invoking an incident resolution policy.

Two-way integration with your ITSM tool is another good practice, so you can create, update and close incidents in your ITSM tool as incidents unfold and gain better context to deal with future incidents. .

Source link

Recommended Posts