Michael Sydor

External Post – What is an APM Suite uniquely useful for in the context of getting new clients?

Posted on March 28, 2019 by Michael Sydor

Originally Posted on Quora

There are really two client facing benefits that APM supports. The first is software quality: being able to show that you have visibility into the software life cycle (Dev, Test, Operate) and actively prevent performance problems from getting deployed. The earlier you can identify problems, the cheaper and quicker it is to resolve them. Your client needs confidence that your software systems, and development life cycle, are not going to negatively impact their business.

The second benefit, which is a direct function of the first, is operational stability: being able to prove that you can meet a contracted SLA (Service Level Agreement) for the overall availability and capacity of the software service they are contracting. Your client should always be willing to pay a premium for a reliable service, and your monitoring, as well as your APM discipline, is how you can different your service offering from a competitor’s.

Any sufficiently complex software system will have some unforeseen problems. It is almost unavoidable, given the rapid pace of enhancement, feature development and deployment. Your ability to employ APM technology to root out these problems proactively, to predict operational capabilities and capacity – this is the story you will tell your clients, in the process of negotiating an contract. Your practice with APM (how you employ the technology) can make a real difference – especially when your competition can not tell a similar story and nor show the same evidence.

External Post – What is a Performance Monitor

Posted on February 8, 2019February 8, 2019 by Michael Sydor

Originally Posted on Quora

In the IT (Information Technology) domain, it is a software system that assess the Availability, Performance and Capacity of the various IT subsystems, such as Mainframe, Database, Middleware, Web and Application Servers, and Network infrastructure (Routers, Firewalls, Switches, etc.)

The software system can be any combination of technologies: agent-based, agent-less (packet filtering), SNMP pings, transaction simulators, or logfiles.

Performance is itself consists of various Response Time measurements: component interactions, database calls, web services – essentially any kind of transaction that has a significant volume and measurable response time.

The challenge in using Performance Monitoring is knowing how to distinguish what is normal, and what is not. The process for this is to characterize the application under load and then survey to find the more frequent transactions that have a significant response time (> 1 msec). Putting these key transactions in to a monitoring group let’s you establish a normal behavior. And when the response time is too short, then use a capacity measure like invocation count.

Finding these key transactions can be accomplish over a week or two of production experience but is better done during QA performance testing where you have better control of inter-system variables and can potentially load the application until failure. A load-to-failure lets you identify the bottlenecks in the application and very often, the key transactions are different under crush load than nominal load.

In a modern enterprise, there are potentially hundreds of IT components that comprise a complex application or service. Figuring out what components are responsible for a degradation of service or outright failure can be difficult or impossible without the visibility that Performance Monitoring provides. Not all applications need full performance monitoring and it is usually reserved for revenue bearing systems or Tier-1 applications.

For many other web services/applications, especially those that are dynamically clusters and multi-site, Performance Monitoring is nice to have but not mandatory. In these situations, loosing a few instances, here and there, is no big deal. But if you want to optimize your clustering costs, or enhance service reliability and customer experience – then Performance Visibility is an essential tool.

You manage what you measure. //ws-na.amazon-adsystem.com/widgets/q?MarketPlace=US&OneJS=1&Operation=GetAdHtml&ServiceVersion=20070822&ad_type=product_link&asins=B004I5BNEA&bg_color=FFFFFF%22%3E+++++%3C%2Fiframe%3E&linkId=350d7dc3274d0f44221644576ee601f8&link_opens_in_new_window=false&marketplace=amazon&placement=B004I5BNEA&price_color=333333&ref=tf_til&region=US&show_border=false&source=ac&title_color=0066C0&tracking_id=spyderjacks-20“>APM best practices

What is the main difference between IT and software engineering?

Posted on August 18, 2017 by Michael Sydor

This topic is a useful jump point for conversations around identifying Best Practices, and making their application reliable and consistent across the enterprise. This is what The APM Practice is focused on – helping to ensure that your organization’s DNA lives on, as new tools and technologies are brought to bear in the evolution of your enterprise.

Originally posted on Quora

IT (Information Technology) is about managing the application of computing technology (hardware, software and networking) to business problems and environments. In the not so distant past, absolutely everybody had some capability, ranging from the college student that kept your PC’s up to date (software and hardware), to hundreds of individuals overseeing and contributing to the implementation and operation of the computing resources of a large enterprise. It required significant budgets, planning and project management to keep up with the changing technology, development and testing, and operation of the resulting physical plant.

Software Engineering is about reliably building systems of software that full-fill business/commercial and personal requirements. It is largely independent of any specific hardware or environment (platform independence) and a single individual can literally “change the world” with a novel application, and with little more than investment than their time to gain the expertise.

With the world literally moving to cloud-based computing resources and applications, the need for private computing centers is fading, and along with it the ‘profession’ of IT. The efficiencies and automation of cloud operations have a single individual responsible for the same work that literally hundreds of staff were required – and that was only a few years ago.

Even the concept of a computer workstation is shifting to mobile and assistants like Alexa. Still early days, for sure but the glory days of the CIO and an army of IT staff are over and done.

What remains is to preserve the useful practices of IT as a foundation for enhanced automation – the lessons learned and tools developed (that are not already commercialized or open-sourced). Everything that IT performed, as individual agents in a large and hierarchical organization, becomes subsumed into the practice and domain of the Software Engineer, and their specialist roles in Systems, Architecture and Security.

What is Infrastructure Performance Management

Posted on August 16, 2017 by Michael Sydor

https://www.quora.com/What-is-Infrastructure-Performance-Management

External Post – Building the “Death Star” of legacy APM tech?

Posted on August 1, 2016January 29, 2017 by Michael Sydor

If you are going to do Enterprise APM, you need a full APM stack: logs, commands, synthetics, real user and byte code instrumentation. There is always an advantage in going with a single vendor vs. ‘best-of-breed’ and the consolidation is just a reflection of the industry maturation. Sure, it might look like the “Death Star” but it is still smaller the hundreds of apps present in an established enterprise – even if you limit your APM to tier-1 services.

Regards the “declining shares” of the big players, I think this has more to do with the decline in infrastructure management (IM) vs. growth of APM. You become proactive in your performance management by catching problems pre-production – rather than precise measurements of the crater you make in production.

External Post – Requirements – do you have to start at the beginning of the application lifecycle?

Posted on July 1, 2016 by Michael Sydor

Requirements are nice but an APM initiative need not begin at the start of a new application life-cycle. Reality is that a menu of apps are available; some mature, some problematic, some of little consequence – and the very few ready to begin with a ‘greenfield’ of clean requirements. You need only assess what proportions are on the ‘menu’ and you can devise a program that will bring them all into good alignment.

Most important is to actually begin with the ‘stable’ applications so that you can practice your deployment and configuration of APM and learn what your stakeholders really want for performance metrics. Then move on to problematic and ‘greenfield’ apps.

Every client environment is different and simply taking an academic and rational approach to performance requirements is going to leave a lot of gaps – and support will evaporate. Better to get going now and implement some visibility quickly and show what you can do – rather than what would be ideal to have.

What is the motivation to write about Best practices?

Posted on October 5, 2015 by Michael Sydor

The very last thing I did at CA – and I mean right before I turned in my badge, was to do a Q&A about why I wrote the APM Book and what is means for CA, customers and the industry at large. I never checked if it was actually used… and now I find it on the YouTube! CA Press Author Series: Mike Sydor

So if you have missed having me on a conference call or leading a customer initiative, you can re-live the magic. All done in one take with the video team pleading: “please don’t move – we keep getting glare from your glasses” – Arg! I can still feel the cramps.

You can find more links on the book and related articles and publications here.

Cheers.

External Post – Two principle metrics you must monitor for any application

Posted on March 21, 2015 by Michael Sydor

Two principal metrics you must monitor for any Application

For sure, transaction volumes and response times are the first two metrics to consider.

But that is not what makes APM a complex undertaking.

Everybody, for years, has been MONITORING volumes and response time. Actually DOING SOMETHING with the data – this is the real gap.

There are many organizational impediments that conspire to bog down the sharing of this critical information. The real focus of APM (where the ‘M’ == management) is to facilitate collaboration on the data. Getting the data to the right people in the organization and giving them a mechanism to effect the change needed to improve or restore performance – this is the gap that the APM vendors still need to work on addressing.

Monitoring without paying attention to collaboration – and you miss the APM value. Nobody picks up a “hammer” and instantly knows how to build a house! The tools today are great ‘hammers’…

Monitoring + Organizational Processes (Collaboration) – and APM delivers. No matter whose ‘hammer’ you happen to have.

External Post – Don’t count out Synthetics from your APM strategies

Posted on March 12, 2015 by Michael Sydor

Don’t count out Synthetics from your APM strategies

For sure, synthetics are an important strategy, especially after hours when traffic is too low for real-transaction volume to be consistent. But if synthetics are all you have – you’re going to be in trouble! Those marketing sites are apparently not being managed. There isn’t much point in monitoring if there is no one to respond to the incident!

Any APM strategy has to allow for those apps/services that are initially unmanaged and could quickly be illuminated with log monitoring or synthetics. For tier-2 and tier-3 apps – this could be all they ever need. But for any app/service of significance you need to quickly follow up with deeper visibility – to whatever that implementation can support. Our first job is to get some visibility, any way we can. Our second job is to start using this information to improve the software service – not simply alert that it has ‘hit the fan’. Otherwise, you end up with a practice that measures “craters” with great precision – but never learns how to avoid the accidents in the first place.

External Post – Completely Automatic Performance Regression!

Posted on March 12, 2015 by Michael Sydor

Completely Automatic Performance Regression!

I always use a load profile that ramps-up for 5 minutes, then steady state for 10-20 minutes and then ramps down for 5 minutes. I need to focus my analysis on that steady-state period . How do I achieve that with your tool set?
Thanks.

The APM Practice

Best Practices, Artifacts and Tutorials for Application Performance Management (APM)

Author: Michael Sydor