Weblogic Authors: Yeshim Deniz, Elizabeth White, Michael Meiner, Michael Bushong, Avi Rosenthal

Related Topics: Weblogic

Weblogic: Article

Holistic Infrastructure Monitoring and Management

Holistic Infrastructure Monitoring and Management

WebLogic Server, like most applications, provides robust and detailed monitoring tools bundled with the basic application. The embedded monitoring and management provided by the WebLogic Console is extremely useful when diagnosing and repairing a problem once it has been isolated in the WebLogic Server. But this embedded point solution is of limited use in most real-world situations where the application server is just a single component in a system of components that are all vital to providing the end-user application. When trying to quickly diagnose a general problem with the end-user application, it is much more powerful and effective to view data from each component holistically as a part of the system rather than evaluating each component by itself.

This holistic view becomes even more powerful when correlated with end-user application and business transaction performance. If business priorities and processes can be included in this view of the infrastructure, operations management becomes more valuable to the enterprise.

Point Solutions
A typical Web infrastructure is composed of a multitude of hardware and software components: hosts running various operating systems, Web servers, application servers, databases, legacy systems, and network devices. Each element of the infrastructure usually includes a point monitoring and management solution.

The command line utilities available to operating systems in the Unix family are typical point solutions. Point solutions provide power and flexibility to the experienced administrator. At the application level, vendors package tools for the monitoring and management of their applications. The WebLogic Server Console provides detailed information about many aspects of WebLogic (current connections, JVM memory usage, database connection pool load, etc.).

The common benefit of these point solutions is that they enable real-time system monitoring, troubleshooting, and resolution by the experienced administrator. Moreover, these tools are ultimately used to solve problems once they have been identified.

One difficulty with point solutions, however, lies in their heterogeneity. It is not always self-evident to an experienced Unix administrator how to identify an errant process on a Windows platform, although the operation is nearly identical to that used on Unix. Similarly, an administrator familiar with WebLogic Server Console might have difficulty using a Web server or another application server's point solution. Point monitoring solutions demand a high level of specialization and domain expertise from administrators.

A second difficulty is in their dispersion. With point solutions there is no visibility across the components, which can be of various types. Unfortunately, an operational problem with WebLogic Server may be related to an underlying problem with the host, the database server it's connected to, or the network layer. Because specialists are hired to fix particular problems, each using their own tools, problem isolation is slow and labor intensive, especially in large and highly specialized IT organizations.

A third drawback to point solutions is that they tend to be reactive. They're used when the system is already down or performing poorly and the administrator is responding in real time to the problem as the business suffers the consequences. Lacking an aggregate tool, inventive teams have crafted scripts and other tools to facilitate these activities, but the maintenance of these tools requires even more expertise.

Infrastructure as a Whole
Infrastructure managers quickly realized that the information they were getting from their various point solutions was much more valuable when viewed together as part of a larger whole. They realized that fault isolation and correlation became much easier once they had this larger view. In the network world, SNMP became the de facto standard for aggregating all of the element data and enabled master management consoles.

Suddenly, infrastructure managers could do more to support the enterprise because they were spending less time in the isolation phase of problem resolution. Infrastructure management was still principally reactive, but it was faster. For the most part, though, these goals were only achieved at the network layer. State models of IP-based networks are much easier to understand and manipulate than what is happening at the application level (see Figure 1).

As network management solutions attempted to integrate infrastructure management and monitoring above OSI Layer 3, the problem of resolving a common view of the health of the infrastructure from disparate data sources becomes even more pronounced. Application vendors extended SNMP support to their products (WebLogic, for example, can be monitored via SNMP) as an attempt to enable a common infrastructure view.

But SNMP can suffer from reliability and complexity issues, as well as being incompletely supported by some components in the infrastructure. Access to command-line utilities and Windows tools like Performance Monitor also remain critical in the complete management of infrastructure resources. Also, databases typically make most statistics available only via SQL.

Agent-based systems are often proposed as an alternative comprehensive monitoring technique, but are difficult to manage and tax system resources unnecessarily. They also often lack access to some of the components you might need.

Common access to system information from heterogeneous sources is vital. Most existing approaches to consolidation involve a Manager of Managers (MoM) system, which receives monitoring data from multiple sources. These systems tend to be prohibitively expensive to purchase, customize, and support. They also take considerable time to implement and require significant training to be used effectively.

NOCpulse Command Center uses a plug-in framework to separate monitoring data from the access methods used to gather that data, providing the benefits of the agent-based and MoM systems without suffering the disadvantages. Lacking a cross-industry standard to reconcile the very different issues encountered when monitoring an application like WebLogic versus an operating system (and such a standard seems unlikely, even considering the distant future), a flexible, extendable product-based standard (like NOCpulse Command Center's plug-in framework) is the next best thing.

Also key is a common data repository and interface. Command Center plug-ins access required metrics via multiple protocols, but the results are presented in a common format via a single Web-based user interface. Performance metrics collected from the infrastructure are gathered in a common data store, allowing easy data mining, historical event correlation, and root cause analysis through a shared report engine (see Figure 2).

The result is a holistic view of all the components that make up an end-user application: up the stack from the operating system through the application layer to the network and vertically from server to server.

Infrastructure and the End User
Too often the focus of monitoring is attention to system problems without regard to the real end-user impact. What is ultimately important is not the specific health of all of the individual components of an Internet infrastructure, but the performance of the application at the other end. Can our customers currently purchase a CD from our site? Is our billing system too slow? Does the customer service section of our site offer the help our customers need? The era of Web applications has given rise to point solutions for end-user monitoring. These products either quantify end-user experience or monitor site accessibility.

Unfortunately, the limited end-user monitoring approach ignores the infrastructure. Web site slow? Administrators go to another solution to solve the problem. Web infrastructures grow quickly in complexity; an administrator might not be able to expediently and effectively correlate end-user performance issues with a particular component of their infrastructure.

Holistic monitoring requires a common interface to both infrastructure health and end-user application performance. NOCpulse Command Center provides this common interface and allows a user to model a multi-step browser-based transaction through a point-and-click configuration tool for both remote and local monitoring.

It may be of interest to see the performance of my e-commerce site at a sample of locations on the public network. But this needs to be triangulated against local performance. Customers were unable to place orders for a time. Do I need to complain to my network provider or do I need to scale up my Web server farm? Point end-user monitoring solutions cannot answer this question; a holistic approach that correlates user experience to infrastructure health can.

Infrastructure and the Enterprise
Once we have a single view of the infrastructure that can be correlated to the performance of our end-user applications and business transactions, resources can be efficiently applied to quickly solve problems. The infrastructure can be tuned to provide better performance with a strong feedback loop of the metrics that matter, ensuring that changes actually have the intended effect. Service Level Agreements can be managed proactively; problems become defined by what the end user is experiencing or by metrics associated with the business transaction rather than by an arbitrary definition of "problem" at the infrastructure level.

The final stage in the development of infrastructure management involves connecting business management to the infrastructure. It involves making the priorities and concerns of the business transparent within the view of the infrastructure. Now precious operations resources are not only able to resolve problems quickly, but they can be applied efficiently to where they matter most: to the most urgent problem, where urgency is determined by business priorities. Fundamentally, it amounts to quickly getting the right people, with the right tools and information, to the most important problem (as defined by the priorities of the business).

For example, we might have problems with two applications: our CRM system is down and our credit card approval process is running slowly. From an operational perspective, the first problem may seem more severe, but when tied to business priorities, the risk of lost revenue mandates that effort be applied to the second problem first. A truly holistic management solution enables these types of decisions automatically.

NOCpulse Command Center allows users to build arbitrary groups of components that correspond to business processes, transactions, customers, or end-user applications. The behaviors of each of these groups (thresholds, notification destinations, escalation procedures, etc.) can be set in accordance with the importance of each group to the business. Critical customer issues get raised and resolved while lower priority or tolerable problems wait until people are free to deal with them.

Fundamentally, infrastructure management adds more and more value to the enterprise as it evolves from an inefficient, slow, reactive, element-focused approach to an efficient, responsive, proactive approach that is able to see infrastructure holistically and understand the relative importance of each end-user application to the business. When that level is achieved, infrastructure management becomes a true business enabler, allowing service level management, efficient customer problem reporting and resolution, and prioritization of fault response in accordance with business priorities. These benefits require a holistic operational view that provides the ability to correlate what is happening at the infrastructure level with what is happening at the business level.

More Stories By Greg Peters

Greg Peters, VP of Engineering at NOCpulse,
has over 10 years of experience in software development,
system engineering, and program management. He leads
the company's core product development strategy and
engineering teams.

More Stories By Lance Peterson

Lance Peterson, program manager at NOCpulse, is responsible
for interface design and engineering program management.
Previously, he worked as a programmer
analyst at Emory University and in an earlier life
taught literature, film, and [email protected]

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.

@ThingsExpo Stories
SYS-CON Events announced today that Yuasa System will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Yuasa System is introducing a multi-purpose endurance testing system for flexible displays, OLED devices, flexible substrates, flat cables, and films in smartphones, wearables, automobiles, and healthcare.
SYS-CON Events announced today that Taica will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Taica manufacturers Alpha-GEL brand silicone components and materials, which maintain outstanding performance over a wide temperature range -40C to +200C. For more information, visit http://www.taica.co.jp/english/.
Enterprises have taken advantage of IoT to achieve important revenue and cost advantages. What is less apparent is how incumbent enterprises operating at scale have, following success with IoT, built analytic, operations management and software development capabilities – ranging from autonomous vehicles to manageable robotics installations. They have embraced these capabilities as if they were Silicon Valley startups. As a result, many firms employ new business models that place enormous impor...
SYS-CON Events announced today that SourceForge has been named “Media Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. SourceForge is the largest, most trusted destination for Open Source Software development, collaboration, discovery and download on the web serving over 32 million viewers, 150 million downloads and over 460,000 active development projects each and every month.
SYS-CON Events announced today that Dasher Technologies will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Dasher Technologies, Inc. ® is a premier IT solution provider that delivers expert technical resources along with trusted account executives to architect and deliver complete IT solutions and services to help our clients execute their goals, plans and objectives. Since 1999, we'v...
As popularity of the smart home is growing and continues to go mainstream, technological factors play a greater role. The IoT protocol houses the interoperability battery consumption, security, and configuration of a smart home device, and it can be difficult for companies to choose the right kind for their product. For both DIY and professionally installed smart homes, developers need to consider each of these elements for their product to be successful in the market and current smart homes.
SYS-CON Events announced today that MIRAI Inc. will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. MIRAI Inc. are IT consultants from the public sector whose mission is to solve social issues by technology and innovation and to create a meaningful future for people.
SYS-CON Events announced today that Massive Networks, that helps your business operate seamlessly with fast, reliable, and secure internet and network solutions, has been named "Exhibitor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. As a premier telecommunications provider, Massive Networks is headquartered out of Louisville, Colorado. With years of experience under their belt, their team of...
SYS-CON Events announced today that TidalScale, a leading provider of systems and services, will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. TidalScale has been involved in shaping the computing landscape. They've designed, developed and deployed some of the most important and successful systems and services in the history of the computing industry - internet, Ethernet, operating s...
Widespread fragmentation is stalling the growth of the IIoT and making it difficult for partners to work together. The number of software platforms, apps, hardware and connectivity standards is creating paralysis among businesses that are afraid of being locked into a solution. EdgeX Foundry is unifying the community around a common IoT edge framework and an ecosystem of interoperable components.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, will discuss how from store operations...
In a recent survey, Sumo Logic surveyed 1,500 customers who employ cloud services such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). According to the survey, a quarter of the respondents have already deployed Docker containers and nearly as many (23 percent) are employing the AWS Lambda serverless computing framework. It’s clear: serverless is here to stay. The adoption does come with some needed changes, within both application development and operations. Tha...
SYS-CON Events announced today that IBM has been named “Diamond Sponsor” of SYS-CON's 21st Cloud Expo, which will take place on October 31 through November 2nd 2017 at the Santa Clara Convention Center in Santa Clara, California.
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, will lead you through the exciting evolution of the cloud. He'll look at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering ...
Infoblox delivers Actionable Network Intelligence to enterprise, government, and service provider customers around the world. They are the industry leader in DNS, DHCP, and IP address management, the category known as DDI. We empower thousands of organizations to control and secure their networks from the core-enabling them to increase efficiency and visibility, improve customer service, and meet compliance requirements.
Join IBM November 1 at 21st Cloud Expo at the Santa Clara Convention Center in Santa Clara, CA, and learn how IBM Watson can bring cognitive services and AI to intelligent, unmanned systems. Cognitive analysis impacts today’s systems with unparalleled ability that were previously available only to manned, back-end operations. Thanks to cloud processing, IBM Watson can bring cognitive services and AI to intelligent, unmanned systems. Imagine a robot vacuum that becomes your personal assistant tha...
With major technology companies and startups seriously embracing Cloud strategies, now is the perfect time to attend 21st Cloud Expo October 31 - November 2, 2017, at the Santa Clara Convention Center, CA, and June 12-14, 2018, at the Javits Center in New York City, NY, and learn what is going on, contribute to the discussions, and ensure that your enterprise is on the right path to Digital Transformation.
Recently, REAN Cloud built a digital concierge for a North Carolina hospital that had observed that most patient call button questions were repetitive. In addition, the paper-based process used to measure patient health metrics was laborious, not in real-time and sometimes error-prone. In their session at 21st Cloud Expo, Sean Finnerty, Executive Director, Practice Lead, Health Care & Life Science at REAN Cloud, and Dr. S.P.T. Krishnan, Principal Architect at REAN Cloud, will discuss how they b...
SYS-CON Events announced today that mruby Forum will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. mruby is the lightweight implementation of the Ruby language. We introduce mruby and the mruby IoT framework that enhances development productivity. For more information, visit http://forum.mruby.org/.
Digital transformation is changing the face of business. The IDC predicts that enterprises will commit to a massive new scale of digital transformation, to stake out leadership positions in the "digital transformation economy." Accordingly, attendees at the upcoming Cloud Expo | @ThingsExpo at the Santa Clara Convention Center in Santa Clara, CA, Oct 31-Nov 2, will find fresh new content in a new track called Enterprise Cloud & Digital Transformation.