While some think about the late Austrian musician, Dan POP and the CNCF community thinks about modern security when it comes to Falco.
Listen in and hear directly from Dan (@danpopnyc) who, besides doing many things in the CNCF community, also hosts POPCAST where he started connecting technology leaders during the last year. In the podcast you learn a lot about security, the power of eBPF and how Falco aims to contribute to runtime security like k8s contributed to distributed computing.
Here the additional links we brought up during the conversation:
Dan on Linkedin
Dan on Twitter
Cyber Defenders Career Guide by Alyssa Miller
Cloud Native TV
CNCF Tag Security
Falco Tools, Frameworks & Articles
Falco Der Kommissar
Dynatrace recently announced Grail – promising boundless observability, security and business analytics in context. You may think: that’s a lot of nice words that other solutions claim as well. So why should you care about Grail? What is the real problem it solves and how does it solve it? Tune in and hear from Andreas Lehofer, Chief Product Officer at Dynatrace as he boils it down to two critical issues: * Cost vs Value of your data: Current approaches are expensive as you keep 95% of your data not knowing whether you ever need it! * Functional Limits with having siloed observability data: When you need answers the current siloed approach is slow and limited! Thanks Andreas for the discussion, the insights on the hidden costs of current approaches, the technical explanation on our architecture as well as giving us some glimpse on what’s coming next. Show Links: Dynatrace Grail Announcement: https://www.dynatrace.com/platform/grail/ Andreas Lehofer on Linkedin: https://www.linkedin.com/in/andreaslehofer/
21 November 2022 • 43m and 10s
“I was not that interested in coding but more in understanding the impact of software on human beings” says Diana Najda, SRE & Monitoring Lead, when we asked her how she ended up leading the efforts around Site Reliability Engineering. Tune in to our conversation and learn how Diana is bridging the gap between Dev, Ops and Business by ensuring that the right people get the right telemetry data from their observability platform. She gives us insights into her definition of DevOps and SRE, how she helps teams setting up SLOs (Service Level Objectives) and how she proves the ROI (Return On Investment) into the SRE practices! Last piece of advice Diana gives everyone interested: “SRE might be buzzword it loses the buzz the more you hear it – BUT - its really cool because SREs make the life of Dev and Ops easier every day” If you want to connect with Diana reach her on LinkedIn: https://www.linkedin.com/in/diannajda/
7 November 2022 • 42m and 18s
Serverless and other emerging technologies hide the complexity of the underlying runtimes from developers. This is great for productivity but can make it really hard when troubleshooting behavior that needs deeper insight into those runtimes, platforms or frameworks. In this episode we hear from Kam Lasater, Founder of Cyclic Software. Kam has run into several walls while he was implementing solutions from scratch using Serverless technologies as well as other popular cloud services. He recently presented a handful of those scenarios at DevOpsDays Boston 2022. Tune in and learn from Kam as he walks us through two of those challenges he covered during his DevOpsDays talk. If you want to learn more make sure to watch the full talk on YouTube: https://www.youtube.com/watch?v=xB9vsSl93mE If you want to learn more from or about Kam check out the following links: YouTube video from DevOpsDays Boston: https://www.youtube.com/watch?v=xB9vsSl93mE Cyclic Website: https://www.cyclic.sh/ Cyclic Blog: https://www.cyclic.sh/blog/ Twitter: https://twitter.com/seekayel Personal Website: https://kamlasater.com/ LinkedIn: https://www.linkedin.com/in/kamlasater/
24 October 2022 • 48m and 33s
Over the years we learned how to optimize the performance of our JVMs, our CLRs or our databases instances by tweaking settings around heap sizes, garbage collection behavior or connection and thread pools. As we move our workloads to k8s we need to adapt our optimization efforts as they are new nobs to turn. We need to factor in how resource and request limits on pods impact your application runtimes that run on your clusters. Out of memory problems are all of a sudden no longer just depending on the java heap size alone! To learn more about k8s optimization best practices we have invited Stefano Doni, CTO of Akamas. Stefano walks us through key learnings as the team at Akamas has helped organizations optimize the performance, resiliency and cost of their k8s workloads. You will learn about proper memory settings, CPU throttling and how to start saving costs as you move more workloads to k8s. To learn more about Akamas go here: https://www.akamas.io/ If you happen to be at KubeCon 2022 in Detroit make sure to visit their booth Show Links: Stefano on Linkedin: https://www.linkedin.com/in/stefanodoni/ A Guide to Autonomous Performance Optimization with Dynatrace and Akamas: https://www.youtube.com/watch?v=i7MuEjeOvX0
10 October 2022 • 43m and 7s
In economic turbulent times leaders get asked questions like: “What’s the return on investment of your DevOps or Cloud Transformation? Did we really get better and more efficient? Or did we just blow a lot of money out the window?” Connecting business results with your technical initiatives is what would answer those questions. To learn how this works we invited Adam Dahlgren, SVP Product at Allstacks. From Adam we learn about Value Stream Management, how to align with your top level OKRs and how to improve your DORA and SPACE metrics. Because as Adam says in the beginning: “Inspection is coming especially during turbulent economic times and they will question your investment in transformation projects!” If you want to follow up with Adam check out the following links we discussed: LinkedIn: https://www.linkedin.com/in/adam-dahlgren/ What are DORA Metrics: https://www.allstacks.com/blog/dora-metrics/?hsLang=en What is the SPACE Framework: https://queue.acm.org/detail.cfm?id=3454124 Allstack: https://www.allstacks.com/ DevOps World sessions from Allstack: https://events.devopsworld.com/widget/cloudbees/devopsworld22/conferenceSessionDetails?tab.day=20220929&search=dora
26 September 2022 • 46m and 34s
We all want to leverage technology to solve problems. New and shiny toys are appealing to look which sometimes means we loose the insights on the base technologies that powers most of our connected lives, such as DNS or TLS. In this podcast we invited Philipp Krenn (@xeraa), Dev Advocate Team Lead at Elastic, and learn about DNS, TLS and other bad config changes. We learn about Log4Shell, how the Java Security Manager was a big help in fighting Log4Shell, why its been deprecated and also get his thoughts into CDD (Conference Driven Development) And if you ever visit Vienna – chances are you meet Philipp dancing Waltz with tourists 😊 Show Links: To learn more from Philipp start with His personal website: https://xeraa.net/ Twitter: https://twitter.com/xeraa LinkedIn: https://www.linkedin.com/in/philippkrenn His conference schedule (past & future): https://xeraa.net/events/
12 September 2022 • 47m and 56s
How do you a design a feature if you don’t know for whom it is for? How do you define SLOs (Service Level Objectives) if you don’t know what your users expect from you? How do you design performance tests and workloads if you don’t know which user behavior to simulate? In this episode we have Barbara Ogris, Sr Product Experience Designer at Dynatrace, who walks us through the concept of target personas that she helped establish within Dynatrace. It changes product and observability discussions from “as a user I want …” towards “as Archie I have this need …”. Listen in and learn about design thinking, using empathy maps to define your target persona and how this can be applied to many aspects in software engineering. Barbara on Linkedin https://www.linkedin.com/in/barbara-ogris-6a0b6011b/ Dynatrace Blog: Terminology matters: how to enhance user experience by aligning names with expectations https://www.dynatrace.com/news/blog/terminology-matters-how-to-enhance-user-experience-by-aligning-names-with-expectations/ Atlassian persona template: https://www.atlassian.com/software/confluence/templates/persona MIRO persona template: https://miro.com/aq/ps/templates/personas/?utm_source=google&utm_medium=cpc&utm_c[…]aIQobChMI6J7Juqql-QIVCOJ3Ch3jtwWwEAAYASAAEgLxT_D_BwE&loc=9062705 Adobe XD: how to define a persona: https://xd.adobe.com/ideas/process/user-research/putting-personas-to-work-in-ux-design/
29 August 2022 • 37m and 1s
SRE vs DevOps, SRE or DevOps or is it SRE & DevOps? No better person to ask than somebody that has been an SRE for much longer than our industry is talking about Site Reliability Engineering. Michael Wildpaner, Sr Engineering Director Cloud Security at Google, started as an SRE for Google Maps back in 2006. Fast forward to 2022 Michael has a lot of hands-on experience about the SRE role, the different levels of SRE that one organization can apply and how it connects with DevOps. Tune in and hear his personal stories from more than 15 years at Google. While not everyone is Google – there for sure is a lot we can take out of this conversation. Here some of my personal take aways Core idea of SRE: take engineers that understand distributed systems and “annoy” / guide developers to build better resilient systems from the start Design for automation: this already starts with naming your infrastructure (aka – don’t use lord of the rings names) SREs help so that you DO NOT DESIGN yourself into a corner Observability is the foundation of good SRE as it enables incident management, insights all the way up to user insights Tip: Ensure new hires understand that you have a blameless culture As follow up material check out those links LinkedIn: https://www.linkedin.com/in/michael-wildpaner Talk at DevOps Fusion 2022: https://devops-fusion.com/en/speaker/michael-wildpaner/
15 August 2022 • 53m and
For some out there SLOs (Service Level Objectives) are the silver bullet to building and operating reliable software. But nothing is as shiny on the inside as it looks on the outside. In this episode we invited Stephen Townshend, former Performance Engineer now converted to Site (Slight) Reliability. Stephen (@the_kiwi_sre) has experienced the tough side of establishing SLOs within an organization. It’s a constant battle between focusing on reliability and new features and a lack of change in culture. Listen in and learn about the 9 pre-requisites for SLOs that Stephen has identified such as: having a certain level of observability, define clear business objectives, define ownership and give autonomy or establishing a blameless culture Stephen on Linked in https://www.linkedin.com/in/stephentownshend/ Stephen on Twitter https://twitter.com/the_kiwi_sre Here the additional resources we brought up during our talk: Slight Reliability YouTube: https://www.youtube.com/c/SlightReliability Slight Reliability Podcast: https://www.buzzsprout.com/1698445 Our LinkedIn discussion: https://www.linkedin.com/posts/scottmooreconsulting_7-steps-to-identify-and-implement-effective-activity-6938919857459462144--RI7 LinkedIn: https://www.linkedin.com/in/stephentownshend/ Twitter: https://twitter.com/the_kiwi_sre
1 August 2022 • 46m and 37s
Security is everyone’s business. And as everyone seems to be moving to Cloud Native it's important to understand what the security landscape in k8s, containerized apps, serverless, … looks like. To learn more about this we invited Anais Urlichs (@urlichsanais), Developer Advocate at Aqua Security and CNCF Ambassador of the year 2021. Over the past years Anais has educated thousands of people on cloud native, devops and security on her YouTube Channel. Tune in and learn more about the different approaches to security in cloud native, which open source projects are out there and how her advise on embedding security in your day2day work. Some additional links we discussed can be found here: Anais on Linkedin: https://www.linkedin.com/in/urlichsanais/ Anais on Twitter: https://twitter.com/urlichsanais Trivy: https://github.com/aquasecurity/trivy Weekly DevOps Newsletter: https://anaisurl.com/ WTFisSRE Talk: https://www.youtube.com/watch?v=0zL61AiOaK0 Anais’s YouTube channel: https://www.youtube.com/c/AnaisUrlichs Aqua Open Source YouTube Channel: https://www.youtube.com/channel/UCb4mfRT5UWpjoUQRcIE2qOQ
11 July 2022 • 44m and 26s
While this episode started out with a recap of April Edwards (@TheAprilEdwards) keynote called “Putting the Ops into DevOps” we quickly got April talk about what measures Microsoft has set to embrace the cultural change needed for their DevOps transformation: Every service has a public health dashboard, putting the customer in the center, make products open source, eat your own dog food, align your objectives with the team, … Besides this great conversation that finally gave some great input on what cultural change really looks like we learned from her background in Ops, moving to Dev, getting into the cloud and now inspiring Ops teams to have it easier in their job using automation. Tune in, learn and get inspired. We also talked about the late Abel Wang and how Microsoft UK is supporting Girls Who Code. Show Links: April on Linkedin https://www.linkedin.com/in/azureapril/ April on Twitter https://twitter.com/TheAprilEdwards Putting the Ops into DevOps keynote https://globalazure.at/sessions/#323994 Supporting Girls Who Code in memory of Abel Want https://www.justgiving.com/fundraising/msbuild2022/?WT.mc_id=modinfra-67727-apedward
27 June 2022 • 48m and 46s
Feature Flagging has gained a lot of momentum which we can observe by counting the number of feature flagging solutions. To ensure a good developer experience when implementing feature flags the CNCF OpenFeature project was launched during KubeCon 2022 in Valencia. It is aiming to provide a feature flag standard similar to what OpenTelemetry did for Observability. Tune in to this podcast where we have two of the founding members Mike Beemer and Todd Baert explain why it was the right time to initiate the project, which problems it solves and what use cases feature flagging brings to organizations. If you want to learn more about the project check out the following resources discussed during the podcast WebSite: https://openfeature.dev/ GitHub: https://github.com/open-feature Community: https://github.com/open-feature/community ITPro Today Launch Coverage: https://www.itprotoday.com/testing-and-quality-assurance/open-source-openfeature-project-takes-flight-advance-feature-flags
13 June 2022 • 43m and 44s
How do you plan for unplanned work such as fixing systems when they unexpectedly break in production? Just like firefighters – the best approach to practice those situations so that you are better prepared when they happen. In this episode we have Mandi Walls, DevOps Advocate at PagerDuty, explain why she loves Game Days where she is “practicing for the weird things that might happen”. Prior to her current role she worked for Chef and AOL – picking up a lot of the things she is now advocating for. In our conversation Mandi (@lnxchk) gives us insights into how to best prepare and run game days, shared her thoughts on what good chaos scenarios (unreliable backend, slow dns …) are and which health metrics (team health, # incidents out of hours, …) to look at in your current incident response to figure out what a good game day scenario actually is. Mandi on Linkedin: https://www.linkedin.com/in/mandiwalls/ In our talk we mentioned a couple of resources – here they are: Mandi’s talk at DevOpsDays Raleigh: https://devopsdays.org/events/2022-raleigh/program/mandi-walls Ops Guides: https://www.pagerduty.com/ops-guides/
30 May 2022 • 47m and 29s
“The most significant body of my SRE work is architectural reviews, disaster and failover planning and help with SLIs and SLOs of applications that would like to become SRE supported.” This statement comes from Hilliary Lipsig, Principal SRE at Red Hat, as her introduction to what the role of an SRE should be. Hilliary and her teams are helping organizations getting their applications cloud native ready so that the operational aspect of keeping a system up & running and within Error Budgets can be handled by an SRE Team. Listen in to this episode and learn about the key advices she has for every organization that wants to build and operate resilient systems. And understand why every suggestion she makes has to be and will always be evidence-based! In the talk we mentioned a couple of tools and practices. Here are the links: Hilliary on Linkedin: https://www.linkedin.com/in/hilliary-lipsig-a5935245/ KubeLinter: https://docs.kubelinter.io Listen to talk Helm and Back again: an SRE Guide to choosing from DevConf.cz: https://www.youtube.com/watch?v=HQuK6txYS3g
16 May 2022 • 50m and 58s
The world is slowly moving back to having on-site meetings and conferences – such as DevOpsDays in Raleigh, NC where Andi presented on “Oh Keptn, my Keptn”. Besides presenting Andi also visited several organizations on his road trip through North Carolina and Texas. Listen in and learn what the adoption challenges of DevOps & SRE are, how to define good SLOs (Service Level Objectives) and how to explain the difference between containers and microservices. Also check out the following links Brian and Andi discussed: State of SRE Report: https://www.dynatrace.com/info/sre-report/ DevOpsDays Raleigh: https://devopsdays.org/events/2022-raleigh/program/andreas-grabner SLOConf: https://www.sloconf.com/ WTFisSRE: https://www.cloud-native-sre.wtf/ Keptn: https://www.keptn.sh
2 May 2022 • 35m and 23s
OpenTelemetry, for some the biggest romance story in open source, as it took off with the merger of OpenCensus and OpenTracing. But what is OpenTelemetry from the perspective of a contributor? Listen to this episode and here it from Daniel Dyla, Co-Maintainer OTel JS and W3C Distributed Tracing WG, and Armin Ruech who is on the Technical Committee focusing on cross language specifications. They give us insights into what it takes to contribute and drive an open source projects and give us an update on OpenTelemetry, the current status, what they are working on right now as well as the near future improvements they are excited about. Show Links: The OpenTelemetry Project https://opentelemetry.io/ Daniel Dyla https://engineering.dynatrace.com/persons/daniel-dyla/ Armin Ruech https://engineering.dynatrace.com/persons/armin-ruech/ List of instrumented libraries https://opentelemetry.io/registry/ Contribute to OTel https://opentelemetry.io/docs/contribution-guidelines/ OpenTelemetry Tutorials on IsItObservable https://isitobservable.io/open-telemetry
18 April 2022 • 52m and 56s
When moving to the cloud - have you thought of the performance difference between App Gateway and Application Load Balancers? The disk speed and disk cache limitations impacting Cassandra and or Elasticsearch Performance? Challenges with pre-built containers or resource limits on pods impacting Java Garbage Collection behavior? These are all performance considerations Klaus Kierer, Senior Software Engineer in the Cluster Performance Engineering Team at Dynatrace, has learned over the past months as he helped performance optimize the Dynatrace Platform as it was expanded from running on AWS Compute to run on Kubernetes hosted in Azure (AKS) or Google Cloud (GKE). Listen in and learn why Performance Engineering is more important than ever as you are moving your workloads to the “hyper-hybrid-cloud”. Show Links: Klaus on Linkedin: https://www.linkedin.com/in/klaus-kierer-67b83a81/ Blog - When to use Azure Load Balancer or Application Gateway: https://blog.siliconvalve.com/2017/04/04/when-to-use-azure-load-balancer-or-application-gateway/ K8ssandra performance benchmarks on cloud managed Kubernetes https://k8ssandra.io/blog/articles/k8ssandra-performance-benchmarks-on-cloud-managed-kubernetes/
4 April 2022 • 56m and 33s
Lift and Shift seems to be “the easiest” cloud migration scenario but can quickly go wrong as we hear from Brian Chandler, Principal Sales Engineer at Dynatrace, in this episode. Tune in and learn how latency can be the big killer of performance as you partially move services to the cloud. Brian (@Channer531) also reminds us about why you have to know about the N+1 query problem and the impact in cross cloud scenarios. Last but not least – Brian gives us insights into why Uber might be one of those companies who can change the SRE & SLO culture within partnering organizations. Show Links: Brian Chandler on Linkedin https://www.linkedin.com/in/brian-chandler-8366663b/ Brian Chandler on Twitter https://twitter.com/Channer531
21 March 2022 • 38m and 25s
Steve Tack has been leading Dynatrace Product Management for the past 10 years. He was one of the few Dynatracer’s delivering the key product announcements from Perform 2022 live from Vegas this year. In this session we recap the key product announcements, which breakouts to watch and which keynotes you don’t want to miss. To make it easier to follow up follow these links: On Demand sessions from Dynatrace Perform 2022 Product Announcements: Observability in Multi-Cloud Serverless, Software Intelligence as Code, DevSecOps Automation Alliances, Real-Time Security Attack Blocking Show Links: Steve Tack on Linkedin https://www.linkedin.com/in/stevetack/ Dynatrace Perform On-demand videos https://perform.dynatrace.com/2022-global Observability for Multicloud Serverless Architectures https://www.dynatrace.com/news/press-release/dynatrace-delivers-the-industrys-most-complete-observability-for-multicloud-serverless-architectures/ Software Intelligence as Code https://www.dynatrace.com/news/press-release/dynatrace-delivers-software-intelligence-as-code/ DevSecOps Automation Alliance Partner Program https://www.dynatrace.com/news/press-release/dynatrace-launches-devsecops-automation-alliance-partner-program/ Real-Time Attack Detection and Blocking https://www.dynatrace.com/news/press-release/automatic-detection-and-blocking-of-attacks/
7 March 2022 • 49m and 56s
Do you regularly go to the gym or are you just wearing your sweat pants and sneakers at home and think that will do it? Or how about agile practices? Do you think by religiously attending your daily standup your colleagues think your performance testing all of a sudden happens within each sprint? Leandro Melendez (aka Senor Performo), a DevRel Advocate for k6 load testing, tells us what he has seen in organizations he is helping to transform their performance engineering practices. The true benefit of becoming Agile, DevOps or whatever the next buzz word is, is to enable engineers with automated performance feedback on their changes. Listen in and also learn about Leandro’s ideas on PDD (Performance Driven Development). Show Links Leandro on Linkedin https://www.linkedin.com/in/leandromelendez/ Señor Performo Site https://www.srperf.com/ K6 Load Testing https://k6.io/
14 February 2022 • 48m and 12s
“Whether open source or commercial – just focusing on logs, traces and metrics is limiting our conversation and missing the point what observability really is!”, says Dotan Horovits, Tech Evangelist at Logz, in his opening statement in this podcast. Listen an and learn more about why observability is not about collecting data. Observability is rather a data analytics problem as it needs to give humans answers to DevOps, SRE and Business questions. To learn more beyond what was discussed in this podcast listen in to OpenObservability Talks, stay up to date on OpenTelemetry or follow Dotan at @horovits Show Links Dotan Horovits on Linkedin https://www.linkedin.com/in/horovits/ Open Observability Talks https://openobservability.io/ Open Telemetry Project https://opentelemetry.io/ Dotan Horovits on Twitter https://twitter.com/horovits
31 January 2022 • 46m and 50s
Who said that automation and data-driven decisions is only for DevOps or SREs? Scalability challenges or quality constraints are just as important to digital marketers like Nina Tollefson, Art Director at Dynatrace. The pandemic caused many events to transform to a fully virtual. That was also true for Dynatrace’s flagship annual global user conference Perform in February 2021. To successfully transform and deliver a flawless experience for Perform attendees, Nina had to embrace automation in her field of work. Just as DevOps engineers Nina updated her tool chain. Figma was a new digital marketing platform of choice to automate, improve collaboration, become more efficient and scale her digital marketing work to support Perform. Today she can cash in on the automation investment she made last year as Perform 2022 is about to start. Thanks to data-driven digital marketing automation it looks like another amazing experience for our global Dynatrace user base. Thanks, Nina, for sharing your story and allowing us to draw many parallels to the DevOps & SRE world. Show Links Nina on Linkedin https://www.linkedin.com/in/nina-tollefson/ Dynatrace Perform https://www.dynatrace.com/perform-2022/
17 January 2022 • 28m and 50s
Log4Shell was an unwelcome early Christmas present for many IT teams around the globe. Asad Ali, Senior Director Dynatrace Sales Engineering, was involved starting December 9th – helping organizations around the world to react to the new vulnerability threat. In our discussion we learn how the vulnerability works technically, how runtime AppSec vulnerability detection eliminates false/positives and how teams around the globe are preparing to protect their software supply chain for future vulnerabilities. To learn more about Log4Shell check out Dynatrace’s Log4Shell Resource Center including educational blogs as well as Asad’s webinar on Detecting and Remediating Log4Shell with Dynatrace webinar. Show Links Asad Ali on Linkedin: https://www.linkedin.com/in/alikingdom/ Dynatrace's Log4Shell Resource Center https://www.dynatrace.com/resource-center/log4j-vulnerability Detecting and Remediating Log4Shell with Dynatrace https://info.dynatrace.com/global-all-wc-dynatrace-for-log4shell-18339-od-fulfillment.html
4 January 2022 • 44m and 4s
Encore Presentation - we'll be back in early 2022, until then, here is one of our favorite recent episodes: To k8s or not – that should be the first question to answer before considering k8s. Granted – in many cases k8s is going to be the right choice but don’t just default to k8s because its hip or cool.
20 December 2021 • 52m and 22s
The rise of smart phones clearly created a new demand for “instant gratification” when it comes to interacting with online services through web sites or apps. To ensure services are available at any point in time without any interruption or delay it requires performance engineers to automate performance and scalability engineering into the development processes. In this episode we invited Mike Kobush, Performance Engineer at NAIC, to hear how he found his way in quality engineering and later on got hooked on performance. He walks us through current performance challenges as his organization is moving to the cloud. Mike also discusses why he is embracing automation as it makes him more desirable as an employee. He shares his goals and motivations such as: Learning something new every day! Tune in and get inspired by Mike. And remember: it always pays off to spend the extra time with somebody – even if it’s a Sunday afternoon in sunny San Diego! Show Links: Mike on Linkedin https://www.linkedin.com/in/michael-kobush-31-performance/
6 December 2021 • 47m and 2s
What are good business level objectives (BLOs) besides conversion rates? Who is responsible for defining them? Who needs to report and who is held accountable? We invited John Kelly, Sales Engineer at Dynatrace, to answer those and even more questions. John – aka Tech Shady - has been helping our customers over the past years to implement business level reporting for their critical applications. It was exciting to hear that there is much more than your classical availability or conversion rate business metrics. The one we think is really exciting is Engagement Rate. So – tune in and learn for yourself Show links: John Kelly on Linkedin https://www.linkedin.com/in/john-kelly-b22b992/ John Kelly on Twitter https://twitter.com/JohnKelly17
22 November 2021 • 52m and 15s
Java developers love using Spring. But running high performing and scaling Java apps in production takes a little bit more than just compiling your code. In this episode we have Asir Vedamuthu Selvasingh who has been working with Java for 26 years. In the past 25 years Asir (@asirselvasingh) helped Microsoft provide services to their developer community that make it easier to deploy, run and operate Java based applications at scale – nowadays on the Azure Spring Cloud offering. Listen in and learn more about observability when deploying apps on the Azure Cloud, which performance and scaling aspects you have to consider and get a look behind the scenes on how your packaged java app magically becomes available across the globe. Show Links: Asir on Linkedin https://www.linkedin.com/in/asir-architect-javaonazure/ Asir on Twitter https://twitter.com/asirselvasingh Monitoring Spring Boot Apps with Dynatrace https://docs.microsoft.com/en-us/azure/spring-cloud/how-to-dynatrace-one-agent-monitor Observability on Azure Spring Cloud https://www.dynatrace.com/news/blog/dynatrace-extends-observability-to-azure-spring-cloud/
8 November 2021 • 41m and
If there is one thing you take away from this episode then the answer to “Why we should refrain from Reply All on company wide emails”. Jokes aside – as security and performance are not always funny! In this special anniversary episode we have Mark Tomlinson, System Performance Specialist, talking about the considerations and trade-offs between performance and security. We learn about performance vulnerabilities and why it is important to factor in the additional overhead each layer of security adds to your application stack. It's always a pleasure having Mark on the show – whether it was in the past, present or will be in the future. If you want to learn more from Mark on the topic of performance make sure to check out PerfBytes that has inspired us to launch PurePerformance. Show Links: Mark Tomlinson on Linkedin https://www.linkedin.com/in/mtomlins/ PerfBytes https://www.perfbytes.com/
25 October 2021 • 49m and 25s
Optimizing or debugging database calls has to become as easy as optimizing your application code based on logs, metrics or traces your observability platform provides to developers. It has to be doable by the development and DevOps teams who are becoming more end-2-end responsible which includes new database services that are running in some managed cloud service. In this episode we hear from Nimesh Bhagat, Product Manager at Google, how modern database observability supports development and DevOps teams to better understand, optimize and operate their end-2-end service flow. A great project Nimesh has been working on is sqlcommenter which uses OpenTelemetry to continue distributed traces started in the application into the internals of the database engine. If you want to learn more check out the sqlcommenter documentation or the Google Podcast on Cloud SQL Insights. Show Links Nimesh on Linkedin https://www.linkedin.com/in/nimesh-bhagat-b062354/ SQLCommenter https://cloud.google.com/blog/products/databases/sqlcommenter-merges-with-opentelemetry SQLCommenter Documentation https://google.github.io/sqlcommenter/ Google Podcast on Cloud SQL Insights https://www.gcppodcast.com/post/episode-247-cloud-sql-insights-with-nimesh-bhagat/
11 October 2021 • 38m and 15s
If you need to learn how Prometheus, OpenTelemetry, Loki, FluentD, FluentBit .. help you with your observability requirements in the cloud native and non-cloud native space but you don’t have hours or days to dig into the details yourself then you have a new place to go to get educated within 20-30 minutes: Is it Observable is a new educational YouTube channel by Henrik Rexed, Cloud Native Advocate at Dynatrace. In this episode we have Henrik explain the motivation of creating Is it Observable, how to best use the videos and the tutorials on GitHub to educate yourself and gave a glance on upcoming episodes that will also include guests from various tool and platform vendors. Make sure to subscribe to his channel, check out the tutorials on GitHub and give him feedback on twitter (@hrexed) or LinkedIn Show Links Is It Observable YouTube Channel: https://www.youtube.com/channel/UCRiin4u8YZGlVQRZp7qOOaw Henrik Rexed on Linkedin https://www.linkedin.com/in/hrexed/ Is It Observable Github Repo https://github.com/isItObservable Henrik Rexed on Twitter https://twitter.com/Hrexed
27 September 2021 • 43m and 50s
Tuning the JVM GC to reduce garbage collection time will speed up application performance. If you agree with that statement then I encourage you to listen to this episode where I have Stefano Doni, CTO at Akamas, walk us through 4 Java Tuning Facts & Myths. He is going into details why even in 2021 with great improvements in the JVM it is still important to optimize the JVM specific to the environment, workload and application behavior. If you want some visuals try to catch his presentation from this years Performance Summit called “How AI optimization will debunk 4 long standing Java tuning myths” To follow up with Stefano check out their resources such as their tutorials on explore.akamas.io, check out their blog posts or videos on their new website akamas.io or follow them on twitter @akamaslabs Links from the Show: Stefano on Linkedin https://www.linkedin.com/in/stefanodoni/ YouTube: How AI optimization will debunk 4 long-standing Java tuning myths https://www.youtube.com/watch?v=9VvaxATyYsA Akamas Tutorials https://explore.akamas.io/ Akamas website https://www.akamas.io/ Akamas on Twitter https://twitter.com/AkamasLabs
13 September 2021 • 40m and 46s
Understanding the secret behind the turbo button on his first 486 PC motivated our guest to study computer science. That decision started a journey making him constantly learn new technology ranging from coding languages, operational tasks as well as a focusing on improving developer experiences and boosting developer productivity Listen in and hear from Michael Friedrich (@dnsmichi), The Ops in Dev Evangelist at GitLab, on why it is important to enable developers to design and develop code that makes it easier for DevOps and SREs to operate and automate. “The biggest challenge is code that breaks production but where there is no clear evidence for DevOps & SREs about the root cause” Make sure to join Michael’s #EveryCanContribute and follow his advocacy such as DockerCon 2021 on From Infrastructure as Code to Cloud Native Deployments in 5 Minutes Links from show: Linkedin https://www.linkedin.com/in/dnsmichi/ Twitter https://twitter.com/dnsmichi Everyone Can Contribute https://everyonecancontribute.com/ From Infrastructure as Code to Cloud Native Deployments in 5 Min https://docker.events.cube365.net/dockercon-live/2021/content/Videos/emEjNyA4WmBSv8BW2
30 August 2021 • 54m and 14s
“Because 9 out of 10 load testing projects fail due to ignorance and outdated thinking about load testing!”. That was the answer Leandro Melendez, aka Señor Performo, gave us when asking him why the world needs yet another book about load testing. In too many projects Leandro has to remind and educate decision makers and practitioner’s about load testing best practices, how to ask the right questions and how to approach a project from start to finish. His book “The hitchhiking guide to load testing projects” is a fun and edu-taining read for people that are new to the trade as well as seasoned performance engineers. For more content from Senor Performo check out his PerfBytes Espanol Podcast, his Spanish YouTube channel and all the performance engineering presentations he has been given over the years. https://www.srperf.com/podcast/ https://www.srperf.com/el-youtube-channel/ https://www.srperf.com/presenter/ Pre-order the book here https://www.amazon.com/dp/B09C4ZT1LB
16 August 2021 • 50m and 2s
Building products that people want to use and activating users to try out new capabilities has to be the ultimate goal of every product manager. User and usage data is the enabler to make the right decisions. But data doesn’t come for free – and making the right decisions is something that data alone doesn’t guarantee Listen in and learn from Manav Chugh, product enthusiast, medium blogger and organizer of ProductTank Linz, what inspired him to choose the path from Zero to Data-Driven Product Manager. In our conversation we cover how to capture what data, the importance of data privacy, what we can learn from companies that do data-driven design well and why he loves organizations such as www.ecosia.org. Links from show: Manav on Linkedin https://www.linkedin.com/in/manavchugh/ Manav's Medium Blog https://manav77-chugh.medium.com/ ProductTank Linz https://www.mindtheproduct.com/producttank/linz Ecosia Search Engine https://www.ecosia.org/
2 August 2021 • 53m and 13s
Like many frontend developers, Sergey Chernyshev was inspired in the late 2000 by Steve Souders to contribute to and grow the web performance community. Not only did he launch the Meed4SPEEDs as part of the New York Web Performance Meetup. He also worked for meetup.com helping them to improve web performance and user experience. Over the past years Sergey contributed to many projects such as WebPageTest.org, UX Capture Library, Jamstack and more. Tune in to this episode, learn what has and what hasn’t changed in the quest for better user experience. Get a quick start on Core Web Vitals and current challenges. Most important: get inspired to contribute back to this community. Llinks we discussed during the podcast are here: Sergy on LinkedIn https://www.linkedin.com/in/sergeychernyshev/ Steve Souders Page: https://stevesouders.com/ NY Web Performance Meetup: https://www.meetup.com/Web-Performance-NY/ WebPageTest: https://www.webpagetest.org/ Github UX Capture: https://github.com/ux-capture/ux-capture Jamstack: https://jamstack.org/ WebVitals: https://web.dev/vitals/ CrUX: https://developers.google.com/web/tools/chrome-user-experience-report Edge Compute: CloudFlare Workers: https://workers.cloudflare.com/ Fastly: https://www.fastly.com/products/edge-compute/serverless PWA Stats: https://www.pwastats.com/ Next.JS approach to SSR: https://nextjs.org/docs/basic-features/pages
19 July 2021 • 58m and 52s
In his SLOConf talk Production load testing as a guardrail for SLOs and in his blog Production Load Testing, Hassy Veldstra, founder of artillery.io makes the case for load testing in production. It helped him in various organizations to establish SLOs (Service Level Objectives) and change the way engineers think about performance. He got inspired by Building Evolutionary Architectures which introduces the concept of performance as a fitness function. Tune in into our conversation, hear our arguments pro and contra load testing in the various environments and learn why in the end we agreed on the fact that SLOs – while nothing really new – are a great chance to re-define performance engineering. Linkedin https://www.linkedin.com/in/hveldstra/ SLOconf: Production load testing as a guardrail for SLOs - by Hassy Veldstra https://www.youtube.com/watch?v=Y20K1mJB6tk Blog: Load testing. In production. http://veldstra.org/production-load-testing/ Artillery Website https://artillery.io/ Book: Building Evolutionary Architectures https://www.thoughtworks.com/books/building-evolutionary-architectures
5 July 2021 • 46m and
How do you convince an organization that just went through a 2 year DevOps transformation to continue the journey by applying SRE practices? What is SRE anyway? What are good SLOs? And how do you get development teams to take responsibility for their code in production? Bart Enkelaar, Lead Site Reliability Engineer at bol.com, not only got their organization to apply SRE practices, define good SLOs and got dev teams to rotate on-call duties. He also followed the advice of Margaret, Chief Platform Officer, to bring his personal passion to the job. This led to inspiring and educating the community about SRE and SLO through music. To see what I mean check out Barts The Game of SLOs – a three part reliability musical from SLOConf or his funny tech conversations at Friendly Tech Chats. Linkedin - https://www.linkedin.com/in/bart-enkelaar-02242710/ Margaret, Chief Platform Officer - https://www.youtube.com/watch?v=hy1gUEhbnBM Game of SLOs: A 3 part reliability musical - https://www.youtube.com/watch?v=Y53Pho93i-k Friendly Tech Chats - https://www.youtube.com/channel/UChHHWkO537q6Yp2dXtJpOzQ/featured
21 June 2021 • 29m and 55s
Wonder what you learn when building k8s from scratch for a large enterprise? Wonder what you learn when automating delivery by connecting your different DevOps tools together? Nana Janashia runs one of the most successful technical YouTube channels called TechWorld with Nana where she covers topics ranging from containers, docker, k8s, cloud native and DevOps. She basically takes her lessons learned and explains technologies and concept in a very easy way especially for folks that want to get started with. In this episode we focus a lot on DevOps, what the right trades of a DevOps engineer are and how to get started. Thanks Nana for your time and all the additional resources we talked about during the episode that are listed below: DevOps Bootcamp: https://www.techworld-with-nana.com/devops-bootcamp DevOps Roadmaps for Humans with Bret Fisher: https://www.youtube.com/watch?v=UXf2c76KAyA DevOps Roadmap: https://roadmap.sh/devops Linkedin Profile: https://www.linkedin.com/in/nana-janashia/ TechWorld with Nana YouTube channel: https://www.youtube.com/channel/UCdngmbVKX1Tgre699-XLlUA
24 May 2021 • 49m and 25s
Have you ever thought about reorganizing data allocation based on production telemetry data? Have you ever thought about shifting compiler budgets to parts of your code that is heavily executed based on profiling information captured from your real end users? Whether the answer is yes or no you will be fascinated by Taras Tsugrii, Software Engineer at Facebook, who is sharing his experience on optimizing everything from compilers, to databases, distributed systems or delivery pipelines. If you want more after listening to this episode check out his recent talk at Neotys PAC titled “Old pattern powering modern tech”, subscribe to his substack newsletter, his hashnode blog, or the conference recordings of Performance Summit and Scaling Continuous Delivery. https://www.linkedin.com/in/taras-tsugrii-8117a313/ https://www.youtube.com/watch?v=itOCQvk_LAs https://softwarebits.substack.com/ https://softwarebits.hashnode.dev/ https://www.youtube.com/channel/UCt50fEvgrEuN9fvya8ujVzA https://www.youtube.com/channel/UCWf9HxiBudKLzCFtgAAz8XQ
10 May 2021 • 58m and 44s
Googles Census, OpenCencus, OpenTelemetry and AWS Distro for OpenTelemetry. Our guest Jaana Dogan, Principal Engineer at AWS, has been working in observability over many years and definitely had a positive impact on the where OpenTelemetry is today. In this episode Jaana (@rakyll) explains which problems the industry, and especially cloud vendors, try to solve with their investment in open source standards such as OpenTelemetry. She gives an update where OpenTelemetry is, the next upcoming milestones such as metrics and logs and what a bright future with OpenTelemetry being widely adopted could bring. https://twitter.com/rakyll If you are interested in learning more – here are the links we discussed during the podcast: https://github.com/open-telemetry https://github.com/open-telemetry/opentelemetry-specification https://github.com/open-telemetry/opentelemetry-proto https://github.com/open-telemetry/opentelemetry-collector https://github.com/open-telemetry/community https://o11yfest.org/
26 April 2021 • 51m and 31s
Performance Engineering is not about running a performance test twice a year. That is just a poor attempt trying to validate your non functional requirements. Roman Ferstl, Managing Directory at Triscon, has discovered his love for performance engineering while optimizing code for software used in a space program. He then founded Triscon who is now helping to establish and scale performance engineering at large enterprises. In this episode we get his insights on how he approaches a new project, which bottlenecks to address first and how to motivate more people within an organization to invest in performance engineering. If you want to learn more don’t miss to check out Roman’s presentation from Perform 2021 titled “Turbocharging your Performance Engineering teams to scale efficiently” https://www.linkedin.com/in/roman-ferstl/ https://www.triscon-it.com/en/ https://perform.dynatrace.com/2021-americas/breakouts-single-day-3-turbocharging-your-performance-engineering-teams
12 April 2021 • 54m and 2s
To k8s or not – that should be the first question to answer before considering k8s. Granted – in many cases k8s is going to be the right choice but don’t just default to k8s because its hip or cool. In this episode we have Christian Heckelmann (@wurstsalat), DevOps Engineer at ERT, talking about his journey with k8s which started with installing k8s 1.9 on bare metal. He gives a lot of great advice based on his presentation “How not to start with k8s” such as Understand Networking, Don’t use :latest, Set Resource Limits, Train The People, Provide Templates and more. To get started with Kubernetes we encourage you to look at the YouTube Tutorials posted on TechWorld with Nana. https://twitter.com/wurstsalat https://docs.google.com/presentation/d/1EL9OYe-1eOPXh6U8SMHnQxs8pcmr01d-uwoWoFnzUaY/edit#slide=id.g5420f4ebeb_0_5 https://www.youtube.com/channel/UCdngmbVKX1Tgre699-XLlUA
29 March 2021 • 52m and 14s
You heard about Continuous Integration, Continuous Delivery and Continuous Deployment. Liquid Software aims to provide the next step towards Trusted Continuous Updates in the DevOps World. In this episode Baruch Sadogursky, DevOps Advocate from JFrog, explains how as engineers we need to add “Updateability” to our non-functional requirements and how product managers and marketing have to forget about traditional releases but think about incremental delivery of value. Baruch (@jbaruch) also promised to send everyone a hard copy of his book “Liquid Software” if you send him a direct message – so – make sure you do that and also check out the details on our discussion of uniquely identifying artifacts through Build-Info. https://www.linkedin.com/in/jbaruch/ https://twitter.com/jbaruch https://drive.google.com/file/d/1PUb67FxM-eTtdyLNGPc-fGTcCJii-keE/view https://github.com/jfrog/build-info
8 March 2021 • 1h, 44s
Software security is about securing websites against malicious attacks or using firewalls to prevent hackers entering your enterprise network. While this is part of software security there is much more that needs to be done – especially as more organizations are developing critical software it is important to protect the whole software delivery lifecycle from any malicious attacks along the supply chain. In this episode we have Michael Plank, Technical Product Manager at Dynatrace, talk about his latest blog post titled How Dynatrace protects its software development and delivery life cycle against supply chain attacks. We learn about attack vectors from development workstation until production deployment. He covers the strategies ranging from static to dynamic code analysis, vulnerability detection or code signatures. Tune in and learn that building secure software is more than ensuring your users have hard to crack passwords! https://www.dynatrace.com/news/blog/how-dynatrace-protects-its-software-development-and-delivery-life-cycle-against-supply-chain-attacks/
22 February 2021 • 58m and 54s
If you are not a gamer you may have never heard about Cyberpunk 2077. If you are – you may know about the challenges during their latest release. Dave Farley (@davefarley77), Co-Author of best seller Continuous Delivery, has been an engineering large and complex systems for decades. His work helped elevate our industry around Continuous Delivery and DevOps. In this episode he shares his learnings from failed projects like Cyberpunk as well as his own latest experiences around that picking the latest technology might be fashionable but is not always the smartest choice. To learn more about Dave check out Continuous Delivery website that also links to his YouTube Channel hosting some of the episodes he was referencing in the podcast. https://twitter.com/davefarley77 https://www.amazon.com/Continuous-Delivery-Deployment-Automation-Addison-Wesley/dp/0321601912#ace-g9859629705 https://www.continuous-delivery.co.uk/ https://www.youtube.com/channel/UCCfqyGl3nq_V0bo64CjZh8g
8 February 2021 • 1h, 5m and 42s
Nobody has foreseen the global pandemic that put a lot of chaos in all our lives recently. Let’s just hope we learn from 2020 to better prepare on what might be next. The same preparation and learning also goes for Chaos in our distributed systems that power our digital lives. And to learn from those stories and better prepare for common resiliency issues we brought back Ana Medina (@ana_m_medina), Chaos Engineer at Gremlin. As a follow up to our previous podcast with Ana, she is now sharing several stories from her chaos engineering engagements across different industries such as finance, eCommerce or travel. Definitely worth listening in as Chaos Engineering was also put into the Top 5 Technologies to look into 2021 by CNCF. https://twitter.com/Ana_M_Medina https://www.spreaker.com/user/pureperformance/why-you-should-look-into-chaos-engineeri https://twitter.com/CloudNativeFdn/status/1329863326428499971
25 January 2021 • 52m and 29s
When moving to microservice architectures its time to re-think continuous delivery. Just as many software services rely on a core data analytics engine to make better automated decisions we need to apply the same for continuous delivery. We can assess the risk of every microservice deployment based on data from production and the desired change of configuration. We can assess the potential blast radius and mitigate it through modern delivery options such as blue/green, canaries or feature flags. Tracy Ragan, Creator & CEO of DeployHub, CDF board member and DevOps Institute Ambassador shares her thoughts on why we need to move to smarter data-driven delivery pipelines. Tracy (@TracyRagan) gives us insights into why not every microservice is created equal and what approaches we can take to better control updates that contain multiple microservice updates. Also make sure to check out their latest project Ortelius and take Tracy up on a virtual coffee chat as discussed in our podcast! https://www.linkedin.com/in/tracy-ragan-oms/ https://twitter.com/TracyRagan https://github.com/ortelius https://go.oncehub.com/15-30MinuteVirtualCoffeeWithTracy
11 January 2021 • 46m and 50s
K8s enables organizations to more easily deploy their containerized solutions as it takes away a lot of the operational tasks which are built-into k8s. This in theory means that you can run your software anywhere and provide it as SaaS offering or deploy it behind corporate firewalls for those customers that demand an on-premise installation. In this episode we have Marc Campbell, Founder and CTO of Replicated, where they help the k8s community to deliver and manage apps on k8s anywhere. For anyone looking into running their apps on k8s you will learn the challenges of Day 1 (delivery, install) and Day 2 (operation, monitoring, troubleshooting) operations. Marc shares common performance and scalability challenges and how to prepare for them during development. In this episode we have Marc Campbell, Founder and CTO of Replicated, where they help the k8s community to deliver and manage apps on k8s anywhere. For anyone looking into running their apps on k8s you will learn the challenges of Day 1 (delivery, install) and Day 2 (operation, monitoring, troubleshooting) operations. Marc shares common performance and scalability challenges and how to prepare for them during development. https://www.linkedin.com/in/campbe79/ https://www.replicated.com/ https://www.heavybit.com/library/podcasts/the-kubelist-podcast/ep-7-keptn-with-andreas-grabner-of-dynatrace/ https://troubleshoot.sh/ https://kots.io/
28 December 2020 • 46m and 16s
Stefan Frandl, Development Director, has a single digit employee number at Dynatrace and therefore seen a lot of agile transformation over the past 15 years – growing from a startup in Linz, Austria to now 800+ engineers across globally distributed labs. A visit to several “unicorns” such as Google, Facebook and Slack triggered the latest agile transformation. In this episode Stefan walks us through the implementation of the changes we discussed with Andrea Holl in her episode on “Scaling Agile at Dynatrace”. He shares the challenges around growing responsibilities of team leads, work left half-finished, overhead on hand-over and cross team collaboration. He then introduces us to the current structure and processes at Dynatrace such as Team Captains, Product Owners and Agile Advocates as well as Dev Directors and Lead Product Engineers. While Dynatrace has seen many benefits already, the journey is still ongoing as Dynatrace is continuously rethinking and improving the way we work and provide value to our customers! https://www.linkedin.com/in/stefan-frandl-aa86723/ https://www.spreaker.com/user/pureperformance/scaling-agile-at-dynatrace-with-andrea-h
14 December 2020 • 45m and 26s
SAFE, LESS or the Spotify Model? Which scaled agile method to apply for your transformation? Or are you unique enough like 44% of organizations based on a European research that are defining their own scaled agile approach to transform successfully? In this episode we sit down with Andrea Holl, Agile Coach at Dynatrace, and let her walk us through the different scaled agile frameworks. She discusses the pros and cons and why many organizations – including Dynatrace – are coming up with their own approaches. For Dynatrace it was about taking the best from the proven frameworks but adapting them to allow us continue or core cultural values such as full autonomy to teams and flexibility of tools and processes. If you are on the brink of a transformation make sure to listen to Andrea and how she and her teams have approached that transformational project! https://www.linkedin.com/in/andrea-elisabeth-holl-b2255a112/ https://www.scaledagileframework.com/ https://less.works/ https://blog.crisp.se/wp-content/uploads/2012/11/SpotifyScaling.pdf
30 November 2020 • 47m and 17s
Daylight savings can bring chaos to systems such as rogue processes consuming CPU or memory and therefore impact your critical systems. The question is: how do you systems react to this chaos? How can you test for this? And how can you make your systems more resilient against this chaos? In this episode we talk with Ana Margarita Medina, Chaos Engineer at Gremlin. In her previous job, Ana (@Ana_M_Medina) was a Site Reliability Engineer at Uber where she helped coping with the “chaos” on New Years Eve or Halloween. Ana gives us great insights into the discipline of Chaos Engineering, that its really about running controlled experiment and that everyone can get started that has an interest in contributing to more resilient systems. Here the additional links we promised during the recording: Drift into failure, Chaos Engineering Community, Chaos Engineering and System Resilience in Practice. https://www.linkedin.com/in/anammedina/ https://twitter.com/Ana_M_Medina https://eng.uber.com/nye/ https://www.amazon.com/Drift-into-Failure-Sidney-Dekker/dp/1409422216 https://www.gremlin.com/community/ https://www.amazon.com/Chaos-Engineering-System-Resiliency-Practice/dp/1492043869
16 November 2020 • 56m and 42s
We are sitting down with Sebastian Scheele (@sscheele), CEO and co-founder of Kubermatic, to discuss the challenges organizations have as they are moving their workloads to k8s and realize that managing, scaling and operating k8s is not getting easier the more k8s clusters you allow your application teams to spin up or down. We learn more about the Kubermatic Kubernetes Platform, the Open Source Project, which centrally manages the global automation of thousands of Kubernetes clusters across multi-cloud, on-prem and edge with unparalleled density and resilience. Thanks Sebastian for answering all the questions we threw at you – questions we have received from many organizations that are moving to k8s but get surprised about the complexity as it comes to properly operating and managing k8s. Sebastian Scheele Twitter https://twitter.com/sscheele Kubermatic Kubernetes Platform https://github.com/kubermatic/kubermatic
2 November 2020 • 55m and 26s
Keptn is now a CNCF sandbox project bringing a new event-driven approach to continuous delivery and operations. While many are just hearing about Keptn the first time, it is interesting to learn more about how it started, which challenges the team ran into, what they learned about K8s, and running an open-source project. We therefore invited Johannes Braeuer (@braeuer_j) and Andreas Grimmer (@grimmer_andreas) – both Keptn project maintainers and contributors – who have been working on the Keptn project since its inception. Especially for groups that want to start open-source projects or are on the brink of deciding pro or con Kubernetes should listen until the end as Johannes and Andreas tell us what they would do differently now if they would start today based on the learnings from the past 18 months. If you want to join the Keptn community, make sure to star our GitHub project, join the Slack channel, and join our regular community meetings! Keptn https://keptn.sh/ Johannes Bräuer on Twitter https://twitter.com/braeuer_j Andreas Grimmer on Twitter https://twitter.com/grimmer_andreas Keptn Github https://github.com/keptn/keptn Keptn Slack https://keptn.slack.com/ Keptn Community https://github.com/keptn/community
19 October 2020 • 1h, 4m and 37s
Getting visibility into .NET code whether it runs on a developer machine, on a windows server on-premise or as a serverless function in the cloud is the day2day job of Georg Schausberger (@BombadilThomas) and Bernhard Ruebl, part of the Dynatrace .NET Agent Team. In this podcast we hear firsthand about the challenges in bringing observability, monitoring and distributed tracing to the .NET ecosystem. They give us insights about their continued effort to reduce startup and runtime overhead, the innovation that comes out of Microsoft as they are moving towards open standards and the noble automated approach to always validated things don’t break monitored code with the constant update of libraries and frameworks. We also got both to talk about their developer experience when working with commercial tools such as Dynatrace and its PurePath technology as well as open source tools when analyzing and debugging their own code or helping users figure out what’s wrong with their code. In the talk both mentioned other tools which we wanted to provide the links for: Benchmark.NET https://benchmarkdotnet.org/articles/overview.html Ben.Demystifier. https://www.nuget.org/packages/Ben.Demystifier/ IIS Module tracing https://forums.ivanti.com/s/article/How-To-Enable-IIS-Failed-Request-Tracing Georg Schausberger https://twitter.com/BombadilThomas https://www.linkedin.com/in/georg-schausberger-6898b6141/ Bernhard Rübl https://www.linkedin.com/in/bernhard-r%C3%BCbl-084881104/
5 October 2020 • 1h, 41s
Successful Cloud Migrations, large scale Kubernetes & OpenShift deployments, making billions of data points actionable and enterprise-wide Citrix & SAP monitoring. These are some of the projects Kayan Hales, Technical Manager at Dynatrace, and her colleagues at Dynatrace ONE help enterprise customers around the world to implement every day. We sat down with Kayan as we wanted to learn what really matters to many large organizations as they embark on automating monitoring into their hybrid multi-cloud environments. While we constantly talk about cloud native and microservices it was interesting to hear what the global team of Dynatrace experts is doing on a day-2-day basis. Kayan gives us insights how important it is to think about meta data, tagging strategies and automation before large scale rollouts and that one of the first question you need to ask is: who needs what type of data at which time through which channels. https://www.linkedin.com/in/kayanhales/ https://www.dynatrace.com/services-support/dynatrace-one/
21 September 2020 • 57m and 10s
Why do some organizations still see performance testing as a waste of time? Why are we not demanding the same level of performance criteria for SaaS-based solutions as we do for in-house hosted services? Why are many organizations just validating performance to be “within specification” vs “holistically optimized”? In this episode we have invited James Pulley (@perfpulley), Performance Veteran and PerfBytes News of the Damned host, to discuss who organizations can level up from performance testing to true performance engineering. He also shares his approaches to analyzing performance issues and gives everyone advice on what to do to start a performance practice in your organization. https://www.linkedin.com/in/jameslpulley3/ https://www.perfbytes.com/p/news-of-damned.html
7 September 2020 • 1h, 8m and 16s
Imagine a future where we deploy every code change directly into production because feature flags eliminated the need for staging. Feature flags allow us to deploy any code change, but only launch the feature to a specific set of users that we want to expose to new capabilities. Monitoring the usage and the impact enables continuous experimentation: optimizing what is not perfect yet and throw away features (technical debt) that nobody really cares about. So – what are feature flags? We got to chat with Heidi Waterhouse (@wiredferret), Developer Advocate at LaunchDarkly (https://launchdarkly.com/), who gives as a great introduction on Feature Flags, how organizations actually define a feature and why it is paramount to differentiate between Deploy and Launch. We learn how to test feature flags, what options we have to enable features for a certain group of users and how important it is to always include monitoring. IF you want to learn more about feature flags check out http://featureflags.io/. If you want to learn more about Heidi’s passion check out https://heidiwaterhouse.com/.
24 August 2020 • 45m and 22s
Adrian Hornsby (@adhorn) has dedicated his last years helping enterprises around the world to build resilient systems. He wrote a great blog series titled “Patterns for Resilient Architectures” and has given numerous talks about this such as Resiliency and Availability Design Patterns for the Cloud at DevOne in Linz earlier this year. Listen in and learn more about why resiliency starts with humans, why we need to version everything we do, why default timeouts have to be flagged, how to deal with retries and backoffs and why every distributed architect has to start designing systems that provide different service levels depending on the overall system health state. Links: Adrian on Twitter: https://twitter.com/adhorn Medium Blog Post: https://medium.com/@adhorn/patterns-for-resilient-architecture-part-1-d3b60cd8d2b6 Adrian's DevOne talk: https://www.youtube.com/watch?v=mLg13UmEXlw DevOne Intro video: https://www.youtube.com/watch?v=MXXTyTc3SPU
3 August 2020 • 55m and 31s
Whether you are still researching on whether you need a Service Mesh or simple use a load balancer or if you are already deploying multi hybrid-cloud architectures and Service Meshes help you secure the location aware routed traffic. In both cases: listen to this episode! We invited Sebastian Weigand (@ThatDevopsGuy) back to our podcast who wrote papers such as Building a Planet-Scale Architecture the Easy Way. In our episode Sebastian walks us through why Service Meshes have gained so much in popularity, what the main use cases are, how you should decide on whether or not use Service Meshes and which challenges you might run into as you expand into using more features. https://twitter.com/thatdevopsguy https://files.devnetwork.cloud/DeveloperWeekNewYork/presentations/2019/scalability/Sebastian_Weigand.pdf
20 July 2020 • 1h, 3m and 22s
Steve McGhee (@stevemcghee) is an expert in post mortems and SRE. He has learned the craft at Google, applied it at MindBody and is now sharing his experiences while back at Google to the larger SRE community. Listen to this episode and learn more about how post mortem analysis can be the starting point of your SRE transformation. How it can help reliability engineering to build and engineer systems that fail gracefully instead of causing full crashes or outages. Steve also went into monitor what matters and only defining alerts on leading indicators with an expiration date – a fascinating concept to avoid a flood of custom alerting in production! If you want to learn more from Steve or SRE check out these additional resources he mentioned in the podcast: The SRE I aspire to be (SRECon19) and his 2 blog part series on blameless.com. https://twitter.com/stevemcghee https://www.youtube.com/watch?v=K7kD_JfRUY0 https://www.blameless.com/blog/improve-postmortem-with-sre-steve-mcghee
6 July 2020 • 1h, 7m and 14s
Keep hearing the terms SLIs, SLOs, SLAs, Error Budgets and finally want to understand what they are, who should be responsible for and how they fit into SRE (Site Reliability Engineering)? Then listen to our conversation with Sebastian Weigand who has been helping organizations modernizing not only their application stacks but also helping them embrace DevOps & SRE. Learn about who is responsible to define SLIs, what the difference between SLOs and SLAs are and what the difference between DevOps & SRE is in his opinion! Sebastian, who calls himself “That Devops Guy” (@ThatDevopsGuy), also suggests to check out the latest free report on SLO Adoption and Usage of SRE as well as SRE Books from Google to get started with that practice. https://www.linkedin.com/in/thatdevopsguy/ https://twitter.com/ThatDevopsGuy https://landing.google.com/sre/resources/practicesandprocesses/slo-adoption-and-usage-in-sre/ https://landing.google.com/sre/books/
22 June 2020 • 1h, 2m and 3s
8 June 2020 • 51m and 17s
How do you prepare for a 2Mio concurrent user load that lasts for 7 seconds? What does the load infrastructure look like? How do you optimize your scripts? How do you deal with DNS or CDNs? In this episode we hear from Joerek van Gaalen who has done these types of tests. He shares his experiences and approaches to running these “special event extreme load tests”. If you want to learn more make sure to check out his presentation and read his blog post from Neotys PAC 2020. https://www.linkedin.com/in/joerekvangaalen/ https://www.neotys.com/performance-advisory-council/joerek_van_gaalen
25 May 2020 • 58m and 14s
Have you ever burned 30k because you forgot to turn off your test VMs over the weekend? Have you ever accidentally deleted “the production table” because you thought you were connected to your dev database? We often only hear the good stories and not those that teach us about what we should not do in order to avoid disaster! Join this episode where Justin Donohoo, Founder and CTO of Observian, tells us horror stories from his professional life that taught him great lessons on what not to do when moving to the cloud, re-architecture because of exponential growth or let the intern do things he/she shouldn’t do. https://www.linkedin.com/in/jdonohoo/
11 May 2020 • 1h, 3m and 46s
Goranka Bjedov has seen the different sides of Open Source while she was working for organizations such as Google, Facebook or AT&T Labs. Before she takes the stage at www.devone.at later this year she gives us her take on Scott McNealy’s quote “Open Source is free like a puppy is free”. Tune in and hear her thoughts on how to pick the right tools, languages or frameworks, how to grow a an open source project and what things you should definitely avoid. https://www.linkedin.com/in/goranka-bjedov-5969a6/ https://devone.at/
27 April 2020 • 57m and 56s
Starting your new job as Infrastructure Engineer in a large bank with your to-be boss and his key architects just leaving feels like Chaos! Maybe that’s why Tammy Butow has made a career in Chaos and Site Reliability Engineering. In this episode, Tammy shares her experiences of bring reliability into highly complex systems at NAB, Digital Ocean, DropBox or now Gremlin through chaos engineering. You learn about the importance to know and baseline your metrics, to define your SLIs and SLOs and to continuously run your fire drills to ensure your system is as reliable as it has to be. If you want to learn more check out Tammy’s presentations on speakerdeck and make sure to join the chaosengineering slack channel. https://www.linkedin.com/in/tammybutow/ https://speakerdeck.com/tammybutow https://slofile.com/slack/chaosengineering
13 April 2020 • 48m and 29s
3 years ago, Adam Auerbach explained how he helped Capital One to automate performance into the DevOps Delivery Pipeline. In 2020, where IT Security is a hot trending topic, Adam works for EPAM and is back advocating for the same shift-left he as advocated for when it comes to functional or performance testing. But now – its about baking Security into your practices & culture. And he has a cool word for it: DevTestSecOps! Listen in and learn which types of security checks can be fully automated in the different stages of the delivery pipeline. Also learn how to prioritize your vulnerabilities as you most likely end up with a lot of noise in the beginning. Adam also highlightes the following open source tools that will help in that transformation: getcarrier.io and reportportal.io. https://www.linkedin.com/in/adamauerbach/ https://getcarrier.io/#about https://reportportal.io/
30 March 2020 • 56m and 46s
DesignOps, just as DevOps or NoOps, is targeted towards increasing the efficiency and collaboration between designers and engineers in order to deliver better, intuitive and consistent user experiences. It requires changes in processes, people and tooling and is heavily driven by enabling engineers to become more autonomous when developing and delivering new value for their organization. Join this podcast and learn from Ursula Wieshofer (@Ursula_W), UX Design Team Lead, as well as from Fabian Friedl (@fabian_friedl), DesignOps Team Lead, on how we live and breath DesignOps at Dynatrace. You will learn about the recently released OpenSource Design System Barista, how it enables our engineers to deliver consistent user experiences across all sorts of software projects and how we manage feedback through the Barista GitHub project for future innovation. Also make sure to check out the recent blog posts UX Guilds as well as how we deal with the constantly changing requirements of our design teams. https://twitter.com/Ursula_W https://twitter.com/fabian_friedl https://barista.dynatrace.com/ https://github.com/dynatrace-oss/barista https://www.dynatrace.com/news/blog/a-running-start-towards-enablement-how-a-ux-guild-will-broaden-our-horizons/ https://medium.com/@holgua/owning-the-unknown-d796e2705ac
16 March 2020 • 29m and 52s
We're back to our regular scheduled show! Kelsey Hightower (@kelseyhightower) has worn many hats as it says on his bio but we also learned from him that he probably doesn’t have that many hats at home as he has been living a minimalistic life over the past couple of years. A philosophy as we learn in this podcast that also goes well when it comes to building your next platform on Kubernetes. In this podcast we learn about the do’s and don’ts, how you should plan and test for k8s upgrades, which tradeoffs you have to take as it comes to performance, how to think about developer productivity on k8s and why it is important to read up on security as it relates to the software we build, deploy and run on our k8s clusters. Thank you Kelsey for supporting our community with your time and expertise. Hope to have you back in the future! https://twitter.com/kelseyhightower
2 March 2020 • 59m and 37s
We wrap up the last day of the Dynatrace Perform 2020 conference with an announcement of winners and learnings for the week
6 February 2020 • 13m and 42s
Vamos terminando el evento teniendo una platica con nuestro amigo Cesar Quintana quien tambien dio una vuelta por nuestro stand a platicar un poco de su experiencia en esta gran cxonferencia.
6 February 2020 • 13m and 47s
Andi Grabner, our man-on-the-street, gets the scoop on: -NoOps: Reaching zero-incident prod through auto-remediation-as-code with Juergen Etzlstorfer -Beyond thresholds – Find anomalies and reduce false positives with Thomas Natschlaeger -Hybrid observability, from enterprise cloud to mainframe and everything in between with Alex Huetter
6 February 2020 • 20m and 45s
6 February 2020 • 36m and 14s
Andi Grabner, our man-on-the-street, gets the scoop on: -Observability and beyond - with Thomas Rothschädl -Automated, AI-powered answers for Kubernetes with Matt Reider -RUM Roadmap with Alexander Sommer
6 February 2020 • 23m and 31s
6 February 2020 • 30m and 28s
Travis Depuy of xMatters speaks to Leandro & Brian about how to leverage tools like xMatters to enable proactive alerting and remediation throughout your code life-cycle
6 February 2020 • 24m and 39s
Leveraging APM to enable smarter cloud migration
6 February 2020 • 18m and 11s
6 February 2020 • 11m and 40s
Andi Grabner, our man-on-the-street, gets the scoop on: -Going web-scale with cross-environment features, globally distributed high availability and more - with Guido Deinhammer -The role of OpenTelemetry in Dynatrace with Daniel Khan and Sonja Chevre -Build resiliency into your continuous delivery pipeline with Michael Villiger
6 February 2020 • 25m and 40s
Tenemos tambien a Gabriel Prioli platicandonos un poco de la conferencua y su experiencia ayudando con la herramienta.
6 February 2020 • 32m and 6s
Nos encontramos con el amigo Sergio que nos platica de las novedades que tinene Dynatrace y ia eperiencia que se vive en la confrencia.
6 February 2020 • 44m and 48s
Andi Grabner, our man-on-the-street, gets the scoop on: -Leverage AIOps with Dynatrace with Wolfgang Beer -Load & performance engineering as a self-service with Rob Jahn -The right way to deploy canary, blue/green and feature flags with Safia Habib
5 February 2020 • 22m and 46s
Tenemos la oportunidad de platicar con Andres Suarez venido desde Colombia participando en la conferencia, quien nos cuenta de su experiencia en este evento asi como de sus aventuras pasadas.
5 February 2020 • 37m and 4s
5 February 2020 • 38m and 44s
Andi Grabner, our man-on-the-street, gets the scoop on: -How to improve every user’s mobile experience - with Dominik Punz -Advanced observability in cloud native microservices and service meshes with Alois Mayr and Sonja Chevre -Monitoring-as-a-self-service with Kristof Renders
5 February 2020 • 7m and 49s
5 February 2020 • 25m and 16s
5 February 2020 • 26m and 51s
We catch up with Mark on his latest adventures at Barbri, what he has planned next and how he gets business answers from Dynatrace using Digital Business Analytics https://www.dynatrace.com/perform-vegas/
5 February 2020 • 29m and 54s
Here we are once again at the Cosmopolitan Hotel and Casino in Las Vegas, NV where…ONCE AGAIN…we are starting our 3 day marathon from the Dynatrace PERFORM 2020 conference. This is one of the biggest high-tech conference focused on performance disicplines from testing, engineering, architecture, monitoring and system scalability using Dynatrace’s innovative solutions. We’ll be chatting with Dynatracers, discussing news topics, giving away shoes and mini drones and sharing conversations with so many of the partners and attendees at this wonderous conference.
5 February 2020 • 34m and 12s
Do you have a clear definition of what Reliability means for your organization? Abigail Wilson, Reliability Architect at CFA Institute, sees this as a key requirement before you start transforming your organization to embrace site reliability, DevOps or Cloud Native. In the podcast we hear how Abigail went on her journey where she has proven that you don’t need a background in IT in order to become an advocate and change agent for reliability engineering. In her role has bridged the gap between business and IT, she has helped bring stable environments to developers and testers and with those and many other steps has increased overall productivity, quality and stability of their business critical applications. https://www.linkedin.com/in/abigailswilson/ https://theabigailwilson.com/
6 January 2020 • 58m and 48s
What does the Dynatrace ACE (Autonomous Cloud Enable) Team work on these days? How do cloud platform owners implement Monitoring as a Self-Service? How to elevate from traditional performance engineering to Performance as a Self-Service? How to bring the Unbreakable Delivery Pipeline to life? What problems can be auto-remediated and how? And what’s the role of Keptn when it comes to boosting the path to Autonomous Cloud? In this episode we have Andi, track leader of Perform 2020’s Release Better Software Faster, talk us through what he is expecting to learn and take away from each breakout session. For more details check out his blog post or sign yourself up for a ticket to Perform 2020. https://www.dynatrace.com/news/blog/getting-ready-a-taste-of-whats-to-come-at-perform-2020s-release-better-software-faster-track/ https://www.dynatrace.com/perform-vegas/
23 December 2019 • 37m and 50s
If you lift & shift to the cloud or move things back from the cloud to on-premise you most likely didn’t understand cloud and how it can help you transform your business and organization. A bold statement but very much true so as we learn in our conversation with Mike Kavis, Chief Cloud Architect at Deloitte. Mike (@madgreek65) has been in technology for 35+ years and was an early adopter of cloud technology as early as 2007 when AWS only had about 6 APIs. He has launched several startups and is now helping organizations to rethink what cloud means for them! Listen in to this podcast and learn why organizations fail if they don’t understand that cloud is about agility, it’s a platform for innovators and that traditional IT teams have to transform into internal cloud providers that provide services to their development teams for them to deliver better business value faster. Mike also runs his own podcast – you may want to listen in to the episode he recorded with Andi on Speeding up Digital Transformation with Cloud Native. https://www.linkedin.com/in/mikekavis/ https://twitter.com/madgreek65 https://www2.deloitte.com/us/en/pages/consulting/articles/digital-transformation-requires-a-cloud-native-mindset-cloud-computing-cloud-value-devops-noops-aiops-software-development-business-value.html
9 December 2019 • 45m and 23s
Wait! What? This is our 100th Episode of PurePerformance? For this special anniversary we invited Mark Tomlinson, Performacologist & “The Performance Sherpa”, who also inspired us through his PerfBytes Podcast to run our own PurePerformance Podcast. While we start with talking about performance in podcasting we move over to learning more about how Mark is establishing a Continuous Performance process at his current employer. We learn about new ways to do performance engineering in a continuous way, how to integrate it with your monitoring and why it is not always important to run the big load tests but rather focus on short feedback cycles. We want to give Mark credit for what he has done for the performance community and use this to say THANK YOU!! Hope to have you back for many more episodes to come and definitely for episode 200! https://www.linkedin.com/in/perfsherpa/ https://www.perfbytes.com/
25 November 2019 • 1h, 7m and
In 2013 the Phoenix Project by Gene Kim, Kevin Bahr and George Spafford sparked the next phase of DevOps transformations. 6 years later Gene Kim (@RealGeneKim) is back with The Unicorn Project, A Novel about Developers, Digital Disruption, and Thriving in the Age of Data. Developer Productivity is a key focus point of the story in the book and is what Gene has learned from different companies in the last years about. Good engineering companies put their best resources in developer productivity as it benefits every developer and allows them to use their best energy to provide business value instead of solving puzzles. Gene lets us in on his day at Etsy as well as the story from Nokia and the reason they moved away from Symbian – both stories that touch on developer productivity! If you want to learn more and read about the Five Ideals then download the excerpts from The Unicorn Project. https://itrevolution.com/the-unicorn-project/ https://twitter.com/RealGeneKim
11 November 2019 • 52m and 18s
ChatOps is not new! But many organizations have not understood nor leverage its full potential. The use cases spread from “What’s on todays cafeteria menu?” to “Deploy my latest Git commit as canary and scale based my SLOs!”. Listen to this podcast and learn from Nestor Zapata and Zohaib Hassan – both working at Citrix – on how they have started their ChatOps journey, how the built trust in the technology and how it helped them transform their organization towards more autonomy thanks to the self-service model enabled through the chat bots they developed. We discuss many of their self-service use cases such as Performance as a Self-Service or even Self-Healing which they implemented through Chat Bots integrated with Slack, Dynatrace, ServiceNow, Jira and other tools. If you want to see their chat ops in action watch our Performance Clinic on Automate Deployment and Site Reliability with Bots, ChatOps and Dynatrace https://www.youtube.com/watch?v=xE_LMQ9u7l4
28 October 2019 • 55m and 51s
16 years and still growing! Not every open source project has the track history of Spring (www.spring.io), a framework for building modern applications for the java runtime. Juergen Hoeller (@springjuergen), creator of the Spring framework, gives us insights into how he and his team have grown Spring to where it is now. We learn how they have built a developer community, how they deal with feedback, why its important to interact with your users on a regular basis and where the road is heading. Juergen also shares insights on topics such as scalability, performance, concurrency of the framework as well as how the rise of new java runtimes and distributions still keeps him excited about the future of Spring. If you want to learn more visit the Spring Framework’s GitHub project and make sure to read up on the latest blogs on https://spring.io/blog/ https://spring.io/ https://twitter.com/springjuergen https://github.com/spring-projects/spring-framework/
14 October 2019 • 1h, 6m and 46s
Are you analyzing the dependency between change frequency, technical complexity, and growth and length of code change hotspots? You should as it helps you with tackling technical debt and risk assessment the right way! In this podcast Adam Tornhill (@AdamTornhill) explains how he is applying data science and forensic approaches on data we all have in our organization such as: GIT commit history, ticket stats, static & dynamic code analysis, monitoring data … He is giving us insights in detecting code hotspots and how we can use leverage this data in areas such as risk assessment, the social side of code changes as well as explaining to business why working on technical debt is going to improve time to market. Also make sure to check out CodeScene: A powerful visualization tool that uses Predictive Analytics to find social patterns and hidden risks in your code Adam Tornhill on Twitter https://twitter.com/AdamTornhill Adam Tornhill's blog https://www.adamtornhill.com/ CodeScene https://www.empear.com
30 September 2019 • 52m and 11s
Did you know that Distributed Tracing has been around for much longer than the recent buzz? Do you know the history and future of OpenCensus, OpenTracing, OpenTelemetry and TraceContext? Listen to this podcast where we chat with Sonja Chevre, Technical Product Manager at Dynatrace, and Daniel Khan, Technical Evangelist at Dynatrace, about the past, current and future state of distributed tracing as a standard. OpenTelemetry https://opentelemetry.io/ OpenCensus https://opencensus.io/ OpenTracing https://opentracing.io/ TraceContext https://www.dynatrace.com/news/blog/distributed-tracing-with-w3c-trace-context-for-improved-end-to-end-visibility-eap/
16 September 2019 • 34m and 9s
In 2018 Adrian Cockcroft was quoted with: “Chaos Engineering is an experiment to ensure that the impact of failures is mitigated”! In 2019 we sit down with one of his colleagues, Adrian Hornsby (@adhorn), who has been working in the field of building resilient systems over the past years and who is now helping companies to embed chaos engineering into their development culture. Make sure to read Adrian’s chaos engineering blog and then listen in and learn about the 5 phases of chaos engineering: Steady State, Hypothesis, Run Experiment, Verify, Improve. Also learn why chaos engineering is not limited to infrastructure or software but can also be applied to humans. Adrian on Twitter: https://twitter.com/adhorn Adrian's Blog: https://medium.com/@adhorn/chaos-engineering-ab0cc9fbd12a
2 September 2019 • 55m and 6s
19 August 2019 • 56m and 39s
Susanne Kaiser (@suksr) has transformed her company from monolith on-premise into a SaaS solution running on a microservice architecture: Successfully! Nowadays she consults companies that need to find their “core domain”, break up and re-fit their architectures and organizational structure in order to truly get the benefit of microservices. In this podcast you learn which questions you need to ask before starting a microservice project, how to find your true “core domain”, how to restructure not only your code but also organization and you get exposed to the concept of Wardley Maps which help you decide what to build vs what to outsource in order to deliver value to your end users the most efficient way. Links: Susannne on Twitter - https://twitter.com/suksr DevOne Conference Page - https://devexperience.ro/speakers/susanne-kaiser/ Wardley Maps Microservices presentation - https://www.slideshare.net/SusanneKaiser3/preparing-for-a-future-microservices-journey-with-wardley-maps
5 August 2019 • 1h, 18s
To service mash or not? That’s a good question! Not every architecture and project needs a service mesh but for running distributed microservices architectures service mashes provide a lot of essential features such as service discovery, traffic routing, security, observability .. We invited Matt Turner (@mt165), CTO at Native Wave, to tell us all we need to know about service mashes. We get a deep dive into Istio, one of the most popular current service mashes, the architecture and how the individual components such as Envoy, Pilot, Mixer and Citadel work together. We also chat about the tradeoff between performance, latency, throughput and service mash capabilities. If you want to learn more make sure to check out Matt’s online content such as blogs and recorded conference presentations on https://mt165.co.uk/. Native Wave https://nativewave.io/ Istio vs. Linkerd CPU Overhead Benchmarks by Michael Kipper Initial Observations: https://medium.com/@michael_87395/benchmarking-istio-linkerd-cpu-c36287e32781 Second Analysis: https://medium.com/@michael_87395/benchmarking-istio-linkerd-cpu-at-scale-5f2cfc97c7fa
22 July 2019 • 46m and 8s
Keptn (@keptnProject) is an open source control plane for Kubernetes enabling continuous delivery and automated operations. In this session we chat with Dirk Wallerstorfer (@wall_dirk) who is leading the keptn development team. We learn from Dirk why they choose knative as serverless framework to let keptn connect to other DevOps tools in the toolchain, how the event driven architecture works, which use cases are supported and where the road is heading. If you are interested also check out our Getting Started with keptn YouTube Tutorial, join the keptn slack channel, keep an eye at the keptn community and give feedback after trying out keptn yourself by following the following installation instructions: https://keptn.sh/docs/ Links: keptn on Twitter - https://twitter.com/keptnProject Dirk on Twitter - https://twitter.com/wall_dirk knative - https://cloud.google.com/knative/ Keptn Video - https://www.youtube.com/watch?v=0vXURzikTac Keptn Slack - https://keptn.slack.com/join/shared_invite/enQtNTUxMTQ1MzgzMzUxLTcxMzE0OWU1YzU5YjY3NjFhYTJlZTNjOTZjY2EwYzQyYWRkZThhY2I3ZDMzN2MzOThkZjIzOTdhOGViMDNiMzI Keptn Community - https://github.com/keptn/community Keptn Docs - https://keptn.sh/docs/
8 July 2019 • 51m and 26s
Can you explain Cloud Native? What are the key OpenSource frameworks you need to know? How about all these OpenSource Licensing models? Why do they exist? Which one to use? What are the monetization models and why to watch closely how Big IT & Cloud companies are impacting this space? Carmen Andoh (@carmatrocity), Program Manager at Google and former Infrastructure Engineer at Travis CI, helps us understand how to navigate the Cloud Native & OpenSource world and gives answer to all the questions above. The IT world is changing but its up to us to shape the future by inventing it. If you want to learn more after listening check out the CNCF Trailmap and follow up with Carmen on social media to get access to her material around that topic! Trailmap https://github.com/cncf/trailmap
24 June 2019 • 56m and 40s
10 June 2019 • 44m and 59s
Self-Healing, Auto-Remediation: Magic words for most IT Leaders! When starting those kinds of projects teams realize their lack of maturity or even understanding of their current IT landscape to even think about Self-Healing. In other scenarios Self-Healing is misunderstood as a band-aid for “keeping the lights on” in order to buy more time for outstanding product improvements vs investing in the core architecture. In this podcast we invited Jon Hathaway, CEO of HATech, and Jarvis Mishler, Solutions Architect Team Lead at HATech (@hatechllc), to learn about how they help organizations assess and improve the maturity of their IT Systems & processes, which auto-remediation actions they typically implement and why real self-healing is not just about keeping the lights on! https://www.linkedin.com/in/jonhathaway/ https://hatech.io/ https://www.linkedin.com/in/jarvis-mishler/ https://twitter.com/hatechllc
27 May 2019 • 43m and 50s
Did you know that the JVM has 700+ configuration settings? Did you know that MongoDB performance can be improved by 50% just by tuning the right database and OS nobs? Every thought that slower I/O can actually speed up database transaction times? In this episode we invited Stefano Doni, CTO at Amakas.io, who gives us a new perspective on how to approach performance optimization for complex environments. Instead of manually tweaking nobs on all sorts of runtimes or services they developed a Goal-driven AI-engine that automatically identifies the optimal settings for any application as it is under load. Make sure to check out their website and white papers where they go into details about how their algorithms work, which metrics they optimize and how you can apply their technology into a continuous delivery process https://www.linkedin.com/in/stefanodoni/ https://www.akamas.io/
13 May 2019 • 54m and 31s
Have you ever used USE? Have you ever wondered what differentiates a performance tester from a performance engineer? Want to know how to automate performance engineering into DevOps Pipelines? Twan Koot, Performance Engineer at Sogeti, is answering all these questions. We met him at the last Neotys PAC Event where he gave an in-depth look on metrics and enlightened us all with USE (a method from Netflix’s Brendan Gregg). In our conversation we explain what USE really is, how to apply it and how a good performance engineer needs to understand more than just response time! Links: Twan on linkedin - https://www.linkedin.com/in/twan-koot-a813a8b7/ Twan's deck from Neotys PAC - https://www.neotys.com/performance-advisory-council/twan-koot Twans Video at Neotys PAC - https://www.youtube.com/watch?v=hV8wpkDUtys http://www.brendangregg.com/ Brendan Gregg's home page - http://www.brendangregg.com/ eBPF - https://prototype-kernel.readthedocs.io/en/latest/bpf/ BCC - https://iovisor.github.io/bcc/
29 April 2019 • 43m and 11s
How many different continuous delivery pipelines do you have in your organization? Do you have dedicated teams that keep them up-to-date and constantly extend them with new tool integrations? Have you already built in capabilities for shadow, dark, blue/green or canary deployments? Is auto-mitigation and self-healing already on your internal pipeline roadmap? Sounds like a lot of manual work? Keptn (@keptnProject)– an open source enterprise-grade framework for shipping and running cloud-native applications – is going to eliminate the manual efforts in building, maintaining and extending pipelines. Alois Reitbauer, Head of the Dynatrace Innovation Lab, gives us the background on how keptn evolved, which cloud native best practices are implemented as core capabilities, how to contribute to this project and gives us a glimpse into where the journey is going. Visit the about page and join the community and make sure to deploy keptn on your own Kubernetes clusters by simply following the step-by-step guides. https://keptn.sh/ https://twitter.com/keptnProject https://keptn.sh/about/ https://keptn.sh/docs/
15 April 2019 • 49m and 47s
Nicki (@nicki_23) was bored in finance, started to learn .NET development on the side and eventually won 250k at a hackathon she used for her startup. Now she is a “Digital” Technical Evangelist at AWS and spreads her passion about Serverless through twitch and shares her code examples on github Tune in if you want to learn more about which things you should know about Serverless and Lambda. We chat about IAM permissions, timeouts, API Gateway and how a CI/CD Pipeline for Lambdas should look like https://twitter.com/nicki_23?lang=en https://www.twitch.tv/aws https://github.com/kneekey23
1 April 2019 • 55m and 13s
Is Cloud Native just a synonym for Kubernetes? How to make sense of the sea of tools & frameworks that pop up daily? What can we learn from others that made the transformation and most of all: Where do we start? We got answer to all these and many more questions from Priyanka Sharma (@pritianka) – Dir. of Alliances at GitLab and Governing Board Member at CNCF (Cloud Native Computing Foundation). In her work, Priyanka has seen everything from small startups to large enterprises leveraging Cloud Native technology, tools and mindset to build, deploy & run better software faster. She advises to start incrementally and whatever you do in your transformation make sure to always focus on: Visibility (which leads to transparency), Easy of Collaboration (which increases productivity & creativity) and Setting Guardrails (this ensures you stay compliant & avoids common pitfalls). We ended the conversation around the idea of needing “Cloud Native Aware Developers” which can follow best practices or standards such as those promoted by CNCF or OpenSource projects such as keptn.sh https://twitter.com/pritianka https://www.cncf.io/ https://keptn.sh/
18 March 2019 • 50m and 31s
The .NET Runtime – whether .NET Framework or .NET Core – provides many ways to optimize memory management. But they don’t come in the form of configuration switches as we know if from Java. While there are a handful of settings, the .NET Runtime favors a different approach: asking developers to write memory aware software that follows a couple of core memory aware principles and best practices. In this podcast we get to talk with Konrad Kokosa (@konradkokosa) – author of Pro .NET Memory Management. In his book he gives developers and operators great tips on how to optimize your .net applications and environments such as #1: start with proper monitoring; #2: reduce memory allocations; #3: well – for this and more you should check out Konrad’s book. Listen in to a great discussion with somebody that has been working very close with the .NET Engineering Teams over the past years and brings the internal secrets of .NET Memory Management to everyone out there that wants to write Memory Aware .NET Software! https://prodotnetmemory.com/ https://twitter.com/konradkokosa
4 March 2019 • 49m and 2s
Creating and maintaining test scenarios not only takes a lot of time, but means we are creating artificial test scenarios based on what we think users are going to do versus replicating real users behavior. In this episode we invited Thomas Rotté, one of our friends from https://probit.cloud, who solved these problems for their work at KBC Bank. Their solution is an AI that learns behavior of real user traffic, creates a probability model for most common user journeys and uses that model to create automation test scripts on the fly for automated, real user simulating test bots. We also learn how GDPR and other challenges influenced their solution and how they are now working with other tool vendors and enterprises to bring this technology to the market.
18 February 2019 • 45m and 42s
How to you scale a startup, a mid size company or an enterprise software organization? Can we learn from the Spartans or the Romans? And how can we explain DevOps to a Dummy? In this fun filled episode with Emily Freeman (@editingemily), Cloud Developer Advocate at Microsoft, we get answers to all these questions and get inspired to join Emily’s appearance at the upcoming devone.at conference in Linz, Austria where she dives deeper into how to successfully scale development organizations from startup to enterprise. Later in 2019 make sure to watch out for the written version of our discussion on DevOps for Dummies – Emily is using her writing skills to bring it to paper! https://emilyfreeman.io/
4 February 2019 • 49m and 50s
Kurt Aigner gave a session about managing hybrid system complexity, from the cloud to the mainframe and everything in between. He shares a few notes and tips in this discussion.
31 January 2019 • 10m and 31s
31 January 2019 • 9m and 32s
In this episode, Andi has a coffee and a chat with Dynatrace's Chief Product Officer Andreas Lehofer where they dig a little deeper into AI Ops Enhancements.
31 January 2019 • 9m and 58s
Gary Carr shares his experience with Dynatrace IaaS cloud support and a deep dive into how Dynatrace open AI provides intelligence into the capabilities and technologies of Azure, GCP and AWS.
30 January 2019 • 23m and 36s
Jimmy Stewart of Kroger, along with Michael Timmers and Kamala Dasika from Pivotal Cloud Foundry, discuss Kroger’s migration to PCF and how they tackle monitoring with the Dynatrace Bosh Agent
30 January 2019 • 18m and 16s
Long-time Dynatracer Nestor Zapata chats with us about Citrix’s fundamental shift from reactive to proactive and predictive operations; moving from data sets and charts to AI-powered answers. His session detailed advantages of a “Gen 3” monitoring approach and how to get there.
30 January 2019 • 23m and 58s
Ryan Murphy shares his experiences with leveraging Dynatrace to help deliver value to customers and partners, especially focusing on Cloud provisioning and multi-environment configuration management.
30 January 2019 • 14m and 44s
JP Morgenthal, CTO of application services at DXC Technologies talks to us about all the things devops you were afraid to ask.
30 January 2019 • 29m and 21s
We take a deep look at the new Session Replay news with Senior Director of Product Management Simon Scheurer
30 January 2019 • 15m and 49s
Steven Marrocco shares his experience with automated monitoring and management using Dynatrace for virtualized environments leveraging Blue Prism for problem-solving, collaboration and knowledge insight algorithms. No actual robots were harmed in this recording.
30 January 2019 • 26m and 21s
E.G. Nadhan, Chief Technology Strategist at Red Hat, talks to us about leveraging open source innovation to maximize performance. https://twitter.com/NadhanEG
30 January 2019 • 18m and 38s
Conversation with Ben Rushlo, VP of Services at Dynatrace, talks to us about dashboarding in Dynatrace
30 January 2019 • 21m and 37s
Wolfgang Beer talks about how to make the most of our latest innovations and enable you to automate and manage your operations at web-scale. He shares insights from his session on management zones, API integrations, deployment automation, and best practices using our open AI.
30 January 2019 • 8m and 57s
In this episode Dynatrace's Michael Villiger shares some tips about his work with Humana to chose PCF as a key platform to power their digital transformation.He reviews the culture change and new methodology that helped to exposed gaps in existing monitoring practices and tools. And how Humana's strategy for the future using public cloud deployment, continuous monitoring with DevOps, and monitoring tool consolidation.
30 January 2019 • 9m and 39s
Sonja Chevre reviews her session on Dynatrace enables Performance Engineering as a Self-Service. She chats about how you can integrate performance testing tools with Dynatrace and how to embed performance diagnostics into the development work-flow for faster automated feedback.
30 January 2019 • 10m and 26s
Carmen Puccio, Principal Solutions Architect at AWS, talks to us about the Dynatrace Managed AWS quick-start program we created with him, modeling your monolith as a microservices platform, some of the Dynatrace announcements from Perform as well as Carmen’s snow adventure with guest of the show Mandus Momberg
30 January 2019 • 14m and 37s
Mark, Jame and Brian talk about today's announcements.
30 January 2019 • 26m and 55s
James and Mark will just chat about everything today, and some NOTD stories and Super Bowl prep! Brian gets ready for his birthday dinner.
30 January 2019 • 26m and 41s
Alois Mayr shares a few new ideas about Dynatrace and Kubernetes, in addition to looking into what Dynatrace plans to release this year.
30 January 2019 • 11m and 7s
Dynatrace's Adam Dawson gives a few tips on gaining visibility across your entire environment, including infrastructure-only nodes with a single, all-in-one solution. His session showed how to extend Dynatrace to monitor your cloud infrastructure health, in addition to your application performance.
30 January 2019 • 17m and 9s
We chat with Henrik about how awesome Francis is, the new integrations anc capabilities for Neotys and Dynatrace, and the upcoming Neotys Performance Advisory Council in Chamonix, France
29 January 2019 • 28m and 23s
Peter Hack joins us to chat about how Dynatrace helps you to get better visibility into your PaaS environment; from deployment to automation. He shares a few tips from his deep dive session into OpenShift featuring Red Hat.
29 January 2019 • 8m and 51s
We catch up with Chris Morgan of Red Hat here at Dynatrace Perform 2019 and he shares several cool things about latest versions of OpenShift and automated operations and integrations with Dynatrace.
29 January 2019 • 12m and 11s
Guido Deinhammer talks about how to make the most of our latest innovations and enable you to automate and manage your operations at web-scale. He shares insights from his session on management zones, API integrations, deployment automation, and best practices using our open AI.
29 January 2019 • 11m and 15s
Dominik Punz shares his insights on his session about Dynatrace support for mobile platforms and how to intelligently monitor your mobile apps and what you can discover from troubleshooting, to enhancing user experience.
29 January 2019 • 9m and 44s
Daniel Khan talks about how Autodesk transformed customer’s businesses using AWS Lambda by simplifying processes, demonstrating real-world examples about how Dynatrace provides deep insights into Lambda functions and efficiency.
29 January 2019 • 10m and 18s
In this episode Jason Westerhouse shares details on how KeyBank is integrating Dynatrace across their DevOps toolchain and driving automated quality gates, continuous feedback and faster incident response time by embracing GitFlow – using Jenkins, containers and release automation to automate CD.
29 January 2019 • 19m and 18s
In this episode Trevor and Kristof give some practical guidance on how to make the vision of autonomous cloud management a reality, including the changes needed, the steps required to get there and benefits you will reap once you have arrived.
29 January 2019 • 9m and 11s
Florian Ortner reviews some of the top Dynatrace integrations and support for all the major cloud IaaS and PaaS platforms. He shares a few ideas about AWS, Azure, Google Cloud Platform, Pivotal Cloud Foundry, OpenShift and Kubernetes, in addition to looking into what Dynatrace plans to release this year.
29 January 2019 • 11m and 36s
We'll chat about today's announcements on the mainstage
29 January 2019 • 32m and 22s
Live from Dynatrace PERFORM 2019 in Las Vegas, it's Pure Perfbytes Performance Welcome Reception
29 January 2019 • 30m and 4s
If you haven’t heard about the 6-R Migration Patterns, then you probably haven’t heard about the 7-R. The 7th stands for R(e-Fit). In this podcast we chat with Mandus Momberg (@MandusMomberg), Principal Solution Architect at AWS. Mandus is sharing what he has learned from small to large scale application migration & modernization projects. We learn about Modernization Factories, that it is key to have decision maker buy-in and that the most common migration scenario is R(e-Fit). Our biggest takeaway are the 3 key measures of success after a migration: Availability, Elasticity & Agility! Now listen in … https://www.linkedin.com/in/mandusm/ https://aws.amazon.com/migration-acceleration-program/
21 January 2019 • 44m and 43s
Migrating from your data center to the cloud is no easy task. In this episode, Patrick Kemble (@PatrickKemble), CTO at SambaSafety, shares their journey from lift & shift to AWS to re-architecting their applications using Cloud Foundry. Along the way, of course, they discovered many important aspects of monitoring. https://twitter.com/patrickkemble?lang=en https://www.sambasafety.com/
7 January 2019 • 37m and 51s
This episode is a recap of Andi’s presentation at AWS re:Invent where he talked common use cases Operation Teams have been auto-remediate over the years and how now Site Reliability Engineering (SRE) Teams take it to the next level. The key point of Andi’s message is to not only auto-remediate these and newer cloud native use cases in production. It is about shifting-left and preventing them upstream in the delivery pipeline. If you want to learn more check out Andi’s blog or watch the recorded session from re:Invent on YouTube. Also make sure to listen until the end to learn about how you can mail your Christmas wishes to either Santa Claus or the Christkind! Blog: https://www.dynatrace.com/news/blog/shift-left-sre-building-self-healing-into-your-cloud-delivery-pipeline/ Video: https://www.youtube.com/watch?v=PsI4pc0NtoI
17 December 2018 • 43m and 34s
Encore Presentation: Goranka Bjedov ( https://www.linkedin.com/in/goranka-bjedov-5969a6/ ) has an eye over the performance of thousands of servers spread across the data centers of Facebook. Her infrastructure supports applications such as Facebook Social Network, WhatsApp, Instagram and Messenger. We wanted to learn from her how to manage performance in such scale, how Facebook engineers bring new ideas to the market and what role performance and monitoring plays.
3 December 2018 • 48m and 42s
Azure DevOps, formerly known as VSTS, is more than just a set of tools. But what is it exactly? How does it help enterprises to deploy better code faster? Does it only work for Azure or other platforms & clouds as well? How can it be extended or integrated into existing processes and tools? Abel Wang, Sr Cloud Developer Advocate at Microsoft, is giving us a tour through Azure DevOps and how he has seen it implemented and integrated into existing enterprise DevOps tool landscapes. We briefly discussed the Unbreakable Delivery Pipeline for Azure DevOps that Abel helped implement and is now available on the Visual Studio Marketplace. So – give it a try and see for yourself what Abel and team has built! Last but not least we also touched upon Azure DevOps for databases and how you to implement regression testing, continuous deployment and canary releases for database updates. Very intriguing topic that we are sure to cover in future sessions in more detail. https://abelsquidhead.com/index.php/2018/08/03/the-dynatrace-unbreakable-pipeline-in-vsts-and-azure-bam/ https://marketplace.visualstudio.com/items?itemName=Safiahabib.DynatraceUnbreakablePipeline https://twitter.com/AbelSquidHead?lang=en
19 November 2018 • 43m and 26s
Happy Guy Fawkes Night! In the first episode with Ben Rushlo, Vice President of Dynatrace Services, we learned about things like not getting fooled by Bot traffic, which metrics to monitor and how RUM can replace your traditional site analytics. In this episode we dive deeper into RUM use cases around user behavior analytics, bridging the silos between Dev, Ops & Business and elaborate on why blindly optimizing individual page load times is most likely wasted time as you won’t impact what really matters: End-to-End User Experience! In our discussion we also talked about UX vs UI as well as importance of Accessibility. Here two links we want you to look at: Holger Weissboeck on Let’s put U in UX and Stephanie Mcilroy’s presentation at DevOne. Listen to Episode 70: https://www.spreaker.com/user/pureperformance/070-exploring-real-user-monitoring-with- Let's Put the U in UX: https://www.youtube.com/watch?v=qi19hls9LfY Stephanie Mcilroy’s presentation at DevOne: https://devone.us/speakers/#stephaniemcilroy Data vs. Info article Brian mentioned: https://medium.com/@copyconstruct/monitoring-in-the-time-of-cloud-native-c87c7a5bfa3e
5 November 2018 • 58m and 47s
Did you know that Azure Service Fabric is used by most of Microsoft’s global high scale services such as Bing, Dynamics or Xbox)? It’s a battle tested distributed systems platform that enables developers to deploy, manage and scale their microservices. In this session we have Sravan Rengarajan, Program Manager at Microsoft Azure, giving us an overview of the key use cases, how Service Fabric started and in which direction it is heading. We also learn how you get your own free local version of Service Fabric and why Service Fabric gets us towards real Serverless computing. Additional information can be found on the Service Fabric GitHub codebase – yeah – its all out there on GitHub! https://www.linkedin.com/in/sravan-rengarajan/ - Sravan on Linkedin http://aka.ms/servicefabricdocs - learn more about Service Fabric http://aka.ms/servicefabricmesh - learn more about Mesh http://aka.ms/tryservicefabric - free clusters to party on! https://github.com/microsoft/service-fabric - GitHub codebase
22 October 2018 • 47m and 45s
Minecraft – the hugely popular sandbox video game – might not be your traditional software to monitor with an APM (Application Performance Management) tool. But Mike Villiger did it anyway in order to learn some advanced concepts in application monitoring such as custom entry points, thread diagnostics, method hotspots or simply to figure out why his mod’ed Minecraft sometimes couldn’t keep up with processing all the changes and skipped cycles. By using Dynatrace – first AppMon now Dynatrace SaaS – he learned more about the internals of Minecraft, how the single threaded architecture calls each mod and why a single mod must not take longer than 50ms to process. Mike gives us insights into which problems he found within Minecraft but more importantly what he takes away for his daily job as a performance advocate and evangelist. If you have any interesting side projects where you use APM tools let us know – there is always something to learn from every project! https://minecraft.net/ https://twitter.com/mikevilliger
8 October 2018 • 36m and 2s
Brett Hofer is giving us his inside story on how he was called for the rescue to break a monolithic healthcare system that, after 3 years of development, was on the verge of having a major business impact on the largest healthcare vendor in the US. We learn about his strategic decisions such as quieting the system, establish traceability and most importantly: setting up a separate team that broke the monolithic while keeping it in sync with the main branch development. Brett Hofer is now Global Practice Lead at Dynatrace where he and his team help Dynatrace customers successfully walking through their digital transformations. https://www.linkedin.com/in/brett-hofer-2432572/
24 September 2018 • 52m and 43s
Ben Rushlo, Vice President of Dynatrace Services, specializes in the Digital Experience. In this episode, Ben talks to us about Real User Monitoring. What happens when good bots go bad? Can Real User Monitoring (RUM) replace your traditional site analytics? If you have RUM, is there any reason to also use synthetics? What performance metrics are the best when it comes to monitoring the end user? How does RUM help you understand the performance of business? Tune in to episode 70 of PurePerformance for answers to these questions. https://www.linkedin.com/in/benrushlo/
10 September 2018 • 54m and 3s
Serverless has been a hot topic for quite a while, but we are still in the early stages when it comes to best practices and tooling. Justin Donohoo, Co-Founder of observian.com, gives us the pros and cons of 4 architectural patterns that he calls: “Microservice / nano pattern”, “Service Pattern”, “Monolithic Pattern” and the “GraphQL Patterns”. Besides these patterns we also learn about common cost traps and how to “architecture around them”. For more information on serverless Justin also shared his recent Serverless Meetup presentation. And stay tuned – there will be more from Justin around secrets, containers and anything else there is to know about cloud native applications. * https://twitter.com/justindonohoo * https://www.observian.com/ * https://docs.google.com/presentation/d/1ffx9wWioFahxQJbhpPhJ3FC-csNXrdEGdufrj5LwV6w/edit#slide=id.p * https://observian.com/tools/secret-awsome/ * https://www.puresec.io/blog
27 August 2018 • 56m and 2s
In our previous episode with Chris Burrell, Head of Technology at Landbay, we learned how they got rid of end-to-end testing in order to speed up continuous delivery. Today we discuss how they still make sure that no code changes in their microservice architecture breaks end-to-end use cases by leveraging Contract-based Testing using Swagger and several tools in the Swagger ecosystem, e.g: diff, code generation … - also make sure to check out Chris’ presentation at yCon called “CDC is dead – long live swagger”. * https://twitter.com/ChrisBurrell7 * https://swagger.io/ * https://github.com/swagger-api/swagger-codegen/tree/master/modules/swagger-codegen/src/main/resources * http://uk.droidcon.com/skillscasts/11147-lightning-talk-cdc-testing-is-dead-long-live-swagger
13 August 2018 • 32m and 4s
How can you get your build times down to minutes? Exactly: eliminate the largest time consumer! At Landbay, where Chris Burrell heads up technology, this was end-to-end testing. Landbay deploys their 40 different microservices into ECS on a continuous basis. The fastest deployment from code to production is 6 minutes. This includes a lot of testing – but simply not the traditional end-to-end testing any longer. Chris gives us insights in contract testing, mocked frontend testing, how they do Blue/Green deployments and what their strategy is when it comes to rollback or rollforward. For more information watch Chris’s talk on 6 minutes from Code Commit to Live at µCon and his lightening talk on CDC Testing is Dead – Long Live Swagger. https://twitter.com/ChrisBurrell7 https://skillsmatter.com/skillscasts/10714-6-minutes-from-code-commit-to-live-you-won-t-believe-how-we-did-it https://skillsmatter.com/skillscasts/11147-lightning-talk-cdc-testing-is-dead-long-live-swagger#video
30 July 2018 • 41m and 49s
Have you heard about Load Shedding? If not then dive into this discussion with Acacio Cruz, Engineering Director at Google ( https://twitter.com/acaciocruz ). He walks us through what Google learnt from one of the early outages at Gmail and how he and his team are now applying concepts such as load shedding to avoid disruption of their services despite spikes of load or unpredictable requests. We also discuss SRE (Site Reliability Engineering), how it started and transformed at Google and how we should think about automation, configuration of automation, and automation of automation. For more details – including visuals – we encourage you to watch Acacio’s breakout session from devone.at on YouTube (Load Shedding at Google). https://devone.at/speakers/#acaciocruz https://www.youtube.com/watch?v=XNEIkivvaV4
16 July 2018 • 1h, 4m and 15s
Security is on everyone’s mind. One way to strengthen security of your software and increase the awareness of your engineers is running a Security Hackathon – or a “Bug Bounty Program”. We invited Pascal Schulz ( https://www.linkedin.com/in/pascalschulz/ ), Security Engineer at Dynatrace, to the show to give us more background on HACK.DT – a security hackathon he and his team ran earlier this year within the Dynatrace Engineering Labs. For additional details check out his blog Running a successful internal bug bounty program and ping him on twitter (@PascalSec) in case you have further questions. https://www.dynatrace.com/news/blog/running-a-successful-internal-bug-bounty-program/
2 July 2018 • 36m and 46s
We got to chat with Danilo Poccia (@danilop), Global Serverless Evangelist at Amazon Web Services, on how to best leverage serverless and its new principles to speed up bringing new features to the market. We learn about Event Driven Architectures, Continuous Deployment into Production leveraging Canary and Linear Deployments as well as how to automate testing when pushing your serverless code through CI/CD. Also – did you know that you can run all your Lambda tests locally in your machine? Check out AWS SAM ( https://docs.aws.amazon.com/lambda/latest/dg/serverless_app.html ) and SAM Local ( https://docs.aws.amazon.com/lambda/latest/dg/sam-cli-requirements.html ) for more information. Make sure to check out Danilo’s Serverless by Design website ( https://sbd.danilop.net/ ) where it you can visually create your end-to-end serverless architecture and get a CloudFormation template to stand up this environment in your AWS account.
18 June 2018 • 45m and 21s
Donovan Brown, Principal DevOps Manager at Microsoft, is back for a second episode on CI/CD & DevOps. We started our discussion around “The role of Monitoring in Continuous Delivery & DevOps” but soon transferred over to our recent most favorite topic “The Unbreakable Delivery Pipeline”. Listen in and learn more about how monitoring, monitoring as code and automated quality gates can give developers faster and more reliable feedback on the code changes they want to push into production. Also make sure to follow up on Donovan’s road show when he shows Java developers how to build an end-to-end delivery pipeline in 4 minutes. And lets all make sure to remind him about the promise he made during the podcast: Building a Dynatrace Integration into TFS and adopt the “Monitoring as Code” principle
4 June 2018 • 49m and 35s
15 Minutes of your day! That’s all it takes to make the first step towards applying DevOps best practices. To hear more about this and other suggestions on how to jump start your DevOps transformation tune into this episode where we chat with Donovan Brown ( http://donovanbrown.com/ ), Principal DevOps Manager at Microsoft. Did you know that over the last 7 years the VSTS team has increased deployment velocity from once every 3 years to once every 3 weeks? Coordinating 50 different feature team that all commit to master daily? If you always thought that transformation like this are only possible in smaller development organizations then be proven wrong by Donovan, a member of the League of Extraordinary Cloud DevOps Activists. If you want instant advice simply tweet using #LoECDA and summon “the league”!
21 May 2018 • 53m and 42s
Serverless comes with its own set of best practices, quirks and benefits when it comes to monitoring and performance engineering. In this episode we have Michael Garski, Director of Platform Engineering at Fender Musical Instruments ( https://www.linkedin.com/in/mgarski/ ), giving us a technical deep dive into lessons learned and best practices they learned when re-platforming their architecture to AWS Lambda. We get to learn about optimizing Cold Starts, Re-Using HTTP Connections, Leveraging API Gateway Caching and finding the sweet spot for CPU & Memory settings to optimize price/performance of AWS Lambda executions. For more details check out Michael’s slides on Innovating Through React Native Mobile Apps. ( https://www.slideshare.net/AmazonWebServices/innovating-through-react-native-mobile-apps-with-fender-musical-instrumentspdf )
7 May 2018 • 54m and 40s
Josh Long ( https://twitter.com/starbuxman ), Developer Advocate at Pivotal, Java Champion and author of 5 books, gives us a great tour through the latest that is happening in the Spring Universe. If you are new to Spring check out http://start.spring.io/ and create your first project within minutes. When it comes to Reactive make sure to check out https://projectreactor.io/ and dive into https://micrometer.io/ to learn more about how to extract metrics from Spring applications. As Josh is constantly traveling the world chances are high you can meet him at a local conference.
23 April 2018 • 56m and 55s
Visual Replay gives you full film-like replay of your end users, including clicks, mouse moves swipes and scrolls. It helps you optimize user experience by addressing problems where end users struggle, e.g: not finding that button, an overlay dialog hiding critical elements or a 3rd party browser plugin that messes with your page. It also supports compliance use cases such as allowing you to proof what information you really showed to the end user when they conducted online business with you. To learn more about this use cases and the technical implementation details of visual replay technology we invited Simon Scheurer, Chief Software Architect at Dynatrace (https://www.linkedin.com/in/simonscheurer/), to this podcast. He educates us on the latest of this disruptive technology! And besides that we also learn about how awesome Simon’s hometown Barcelona, Spain is.
9 April 2018 • 40m and 6s
You wouldn’t build your own Jenkins – would you? Neither would you build your own CRM, Office or Email service. So why are the “cool” DevOps kids still building their own continuous delivery scripts, log analytics and monitoring and showing it off on GitHub or conferences? In this episode we invited Steve Burton (@BurtonSays), CD geek at harness.io, and discussed the current state of Continuous Delivery and the role of Observability (that’s Monitoring++). We learn about use cases that commercial vendors in these spaces provide out-of-the-box, the APIs they offer to integrate these tools into a larger eco-system and why we believe it’s time to stop building your own tools but investing in building better software for your users. We also learn about Blue/Green deployments, Canary Releases, Continuous Verification and Rollback vs Roll-forward.
26 March 2018 • 43m and 43s
If you still believe Blockchain is just about Bitcoin or that Blockchain is a super safe, high performing platform that simply runs then listen in to this podcast. With David Jones ( https://twitter.com/davidlewisjones ) –AIOps Evangelist – we learn about the different use cases of Blockchain technology, the two top frameworks Ethereum ( https://www.ethereum.org/ ) and Hyperledger ( https://www.hyperledger.org/ ), and also discuss how to monitor both usage and operation of Blockchain to ensure performance for end user applications. As a great read check out his recent blog post on AI-based Monitoring to ensure Blockchain Performance: https://www.dynatrace.com/blog/using-dynatrace-ai-based-monitoring-ensure-blockchain-performance/
12 March 2018 • 38m and 10s
New to Kubernetes? Already a pro? In both cases, tune in to this episode, as we have something for both sides of the aisle. Kubernetes seems to have won the container orchestration game. Major cloud and PaaS vendors are supporting Kubernetes, and attendance at KubeCon in Dec 2017 skyrocketed. Today we chat with Brian Gracely ( https://twitter.com/bgracely ), Director of Strategy at Red Hat. Brian also co-hosts @PodCTL ( https://twitter.com/PodCTL ) – a podcast dedicated to containers, OpenShift, Kubernetes, and Cloud Native. In our chat we learn where and what Kubernetes is right now, where its heading (e.g: providing better onboard experience with developers, more APIs …), why we have to pay attention to Service Mesh ( http://philcalcado.com/2017/08/03/pattern_service_mesh.html ), and why it is important to have a good cross technology monitoring strategy that supports both your brown field legacy services as well as the green field cloud native. We also enlighten you about what the BWI (Brian Wilson Indicator) is!
26 February 2018 • 51m and 25s
James Turnbull ( https://jamesturnbull.net/ ) is an author of 10 books on topics like Docker, Packer, Terraform, Monitoring, … and is currently writing a book on Monitoring with Prometheus https://prometheusbook.com/ . We got to chat about what modern monitoring approaches look like, how to pull in developers to start building monitoring into their systems and how to bridge the gap between monitoring for operations vs monitoring for business. Having a monitoring expert like James that knows many tools in the space was great to validate what we at Dynatrace have been doing to solve modern monitoring problems. We learned a lot about key monitoring capabilities such as capturing data vs capturing information, providing just nice dashboards vs providing answers to known and unknown questions and making monitoring easy accessible so that monitoring can benefit both business, operations and developers. We hope you enjoy the conversation and learn as much as we did. A blog we have been referencing several times during the talk was this one from Cindy Sridharan on Monitoring in the time of Cloud Native: https://medium.com/@copyconstruct/monitoring-in-the-time-of-cloud-native-c87c7a5bfa3e
12 February 2018 • 1h, 5m and 39s
31 January 2018 • 55m and 31s
31 January 2018 • 39m and 41s
Steve Pace, Senior Vice President of Global Sales at Dynatrace, discusses the positioning and offerings of Dynatrace in the AWS Marketplace
31 January 2018 • 6m and 9s
Catch up on what was announced this morning
31 January 2018 • 7m and 56s
We are learning more and more about these deceptively simple-sounding improvements and new features coming for Dynatrace, let's break it down a little more.
31 January 2018 • 24m and 50s
31 January 2018 • 1h, 15m and
Henrik discusses the new Neotys integration with Dynatrace
30 January 2018 • 18m and 14s
We stepped aside for just a few minutes to learn more about RedHat's OpenShift products with Chris Morgan. We chat about their experience in building integration between Dynatrace and OpenShift, excitement about the conference announcements and a shared distrust of mustard-based barbecue sauce in South Carolina.
30 January 2018 • 14m and 26s
Live Lunch with Perf Bytes and PurePerformance
30 January 2018 • 24m and 18s
Live from the conference on day 1, the early annoucements, reflections, excitement, caffeine and live stream from the conference: http://perform.dynatrace.com
30 January 2018 • 51m and 44s
Dave Anderson of Dynartrace
30 January 2018 • 1h, 47m and 55s
Broadcasting Live from Dynatrace Perform 2018 in lovely Las Vegas, Nevada, USA
30 January 2018 • 44m and 24s
Markus Heimbach, Team Lead of the Infrastructure and Service Team at Dynatrace, explains the continuous delivery process of www.dynatrace.com really works behind the scenes. 2 years ago the web site team used a traditional CMS (Content Management System) which was slow, error prone, and didn’t deliver the expected end user experience for visitors of our website. 2 years later Markus and his team built a fully automated “Content Delivery Pipeline”. The team decided to leverage Git, static generated web content, immutable infrastructure, and Dynatrace OneAgent monitoring. Production deployments happen twice a day but staging and development deployments – using the same deployment pipelines – happen much more frequently. The result is a very flexible delivery pipeline, fully version controlled content, a very secure and fast website and everything monitored with Dynatrace. Thanks Markus for letting us look behind the scene of www.dynatrace.com
29 January 2018 • 31m and 55s
Feature Toggles or Feature Flags are not new – but they are a hot topic as they allow safer and fearless continuous delivery. Finn Lorbeer ( https://twitter.com/finnlorbeer ) gives us technical insight into how he has been implementing feature toggles in projects he was involved over the last years. We learn why he loves https://github.com/heartysoft/togglez, how to test feature toggles, monitor the impact of features being toggled and how to make sure you don’t end up in a toggle mess.
15 January 2018 • 30m and 35s
Have heard about “Shifting Left”? Well – get prepared to hear that Shift-Left is not the only solution to building a high quality products. Finn Lorbeer ( http://www.lor.beer/ ) is a Product Quality Specialist working for Thoughtworks. In a recent presentation given at Quest4Quality ( http://questforquality.eu/speakers/finn-lorbeer/ ) in Dublin he explained how being a quality engineer is no longer about being seen as a quality gate (and sometimes bottleneck) in the deliver cycle. Finn is sharing his experience from a recent project with a large German automobile company where he helped transform the development teams to shift-left on quality but not only for the software they develop and test, but also for the type of features they implemented and how they see quality as a whole in their end-to-end delivery cycle. Interesting lessons learned on how to speed up delivery, increase quality and make everyone part of the game!
1 January 2018 • 42m and 46s
Erik Landsness, Director Network Operations Center & SRE at Beachbody, talks us through his last 1.5 years in his role where he has been transforming the role and culture of the traditional NOC team from human-based Dashboard analytics to a Automated Self-Healing Zero-Dashboard Culture. While they haven’t yet reached that end state they have made big strides. Erik shares with us how to gradually transform into a modern operations team that automates things that humans shouldn’t do – such as staring at dashboards on walls 😊 Erik is also presenting at Dynatrace PERFORM 2018. Make sure to check out his session to learn first hand! https://www.dynatrace.com/perform/speakers/
18 December 2017 • 40m and 18s
Are you still deploying machines manually? Do you have to login to machines to apply changes? Do you spend hours or even days to detect infrastructure issues messing with your test execution or even production? We have the answer for your pain: Listen to this podcast! Markus Heimbach leads the Infrastructure and Service team at Dynatrace and explains how they got rid of Snowflakes (not in the political sense), tackled the Configuration Drift issue, and how his team became a Service Organization powering the innovation at Dynatrace R&D. Get a glimpse of his talk track from his presentation at #devone.at - https://speakerdeck.com/markusheimbach/infrastructure-as-code As another teaser: you will hear about Test Automation of Infrastructure Code, leveraging Docker and Kubernetes (k8s) and how to use and leverage Immutable Infrastructure!
4 December 2017 • 52m and 30s
We typically hear about agile transformation being driven from development and eventually pushing it towards operations. But it doesn’t have to be that way as we hear from Nestor and Abeer who helped transform their operations team from Waterfall (Traditional Ops), to Partial Scrum (Intro to Agile) and then Kanban (more defined structures for Ops using Agile principles). Listen in and learn what the differences are between Agile in Dev and Agile in Ops, which metrics they use to measure the success and how they are now pushing towards a DevOps transformation from Ops towards Dev. If you want to chat live with Nestor and Abeer then take the chance and meet them at PERFORM 2018 ( http://perform.dynatrace.com ) where they give us more insights into their transformation
20 November 2017 • 52m and 21s
Most of us remember the DDOS attack last year executed through thousands of Security Camera IoT devices. This raised security questions around IoT but also helped the public to understand that IoT (Internet of Things) is a real thing. In this session, we learn from Harald Zeitlhofer ( https://twitter.com/HZeitlhofer ) why he rather likes to call this hot trend IoE (Internet of Everything), what the key use cases of IoE are and how proper monitoring of these devices might have been the key to detect the attack before it actually happened. To learn more about this exciting next big thing we suggest to start with Harald’s latest blog posts on his most favorite topic.
6 November 2017 • 41m and 40s
If you believe OpenStack and OpenShift are pretty much the same thing. you better listen to this episode with Martin Etmajer ( https://twitter.com/metmajer ). He explains what OpenShift is, how it differentiates from Cloud Foundry and other PaaS platforms, and which major contribution it can have to successful DevOps transformations. To put it in his words: OpenShift provides great user experience for developers to push their code changes automatically, packaged as containers, into different environments without having to worry about where and how these containers run or how they scale up & down. You should also check out his presentations from Red Hat Summit on Monitoring and Logging in OpenShift ( https://www.slideshare.net/martinetmajer ) as well as more material on http://www.dynatrace.com/openshift.
23 October 2017 • 49m and 21s
Project Jigsaw, G1 as default garbage collector, ahead-of-time compilation, Stack Walking API and many more changes that you should be aware of when upgrading to Java 9. Philipp Lengauer, whom we met at devone.at, gives us all the answers and technical deep dive into all these JVM changes. Especially for performance engineers an episode worth while listening to. If you want to learn more check out Philipps presentation at devone: https://youtu.be/Nsg_rhlf4_U?list=PLfi6VUNSzNYmUyeZ2BTM_WmZjgi0FOqRl
9 October 2017 • 50m and 50s
If you thought EC2 was the first service offered by Amazon Web Services and if you think 53 in “Route 53” is just a random number then you should listen to this 101 on AWS Podcast. This time we got to chat with Wayne Segar ( https://www.linkedin.com/in/wayne-segar-6222ba57/ ) who has been helping companies to move to new cloud technologies and services such as AWS. Wayne gave us a great overview of the key services in Compute, Database, Storage, Management, Development as well as how Monitoring works with AWS. If you want to make your first steps with AWS, such as deploying your first EC2 Instance or Application on Elastic Beanstalk, then feel free to follow our 101 AWS Monitoring Tutorial: https://github.com/Dynatrace/AWSMonitoringTutorials
25 September 2017 • 58m and 58s
Why would I move to .NET Core? If I move, can I just recompile my .NET code with the new .NET Core and run it on Linux? Or is there more to it? What is .NET Core at all and what does it provide as compared to ASP.NET Core? Can I still monitor my .NET Applications the same way as in the past or is there a new approach for tracing and monitoring? And is it true that all of this is now available on GitHub as Open Source project? Get answers to all these questions by listening to this episode where we got to talk with Christoph Neumueller @discostu105 ( https://twitter.com/discostu105 ) and Gergely Kalapos @gregkalapos ( https://twitter.com/gregkalapos ). Christoph and Gergely are two lead engineers for the Dynatrace .NET Agent technology. They are also code contributors to the .NET Open Source and other open source projects such as SuperDump ( https://github.com/Dynatrace/superdump ). Also check out their blogs ( https://www.dynatrace.com/blog/tag/net/ ) to learn more on .NET Core, ASP.NET Core and other .NET relevant performance topics. And, if you're a Dynatrace customer, make sure to up-vote the RFE to allow hot sensor placement for .net core. ( https://answers.dynatrace.com/spaces/151/product-feedback-and-enhancement-requests/idea/183988/hot-sensor-placement-for-net.html ) Additional Links: TechEmpower Benchmark ( https://www.techempower.com/benchmarks/ ) Blog about performance improvements as a result of community input ( https://blogs.msdn.microsoft.com/dotnet/2017/06/07/performance-improvements-in-net-core/ ) Download and try out .Net Core ( http://dot.net ) Dynatrace Free Trial ( https://www.dynatrace.com/trial/?vehicle_name=www.spreaker.com )
11 September 2017 • 51m and 33s
Visually Complete and Speed Index have been introduced to better measure real end user performance experience. Klaus Enzenhofer @kenzenhofer ( https://twitter.com/kenzenhofer ) gives us a detailed description of these metrics, how they are getting calculated, and which problem they solve. What we also learn in this 101 is why now we finally have these metrics available not just for synthetic monitoring but also for real user monitoring. This can be attributed to the advances in browser technologies as well as to some smart engineering. In our discussion we also cover other recent advances and use cases in Web Performance Optimization – such as the usage of performance markers. If you want to learn more check out the blogs from Google on Speed Index ( https://sites.google.com/a/webpagetest.org/docs/using-webpagetest/metrics/speed-index ) as well as the blog from Klaus on how Speed Index and Visually Complete made it into RUM offerings ( https://www.dynatrace.com/blog/visually-complete-speed-index-for-real-user-monitoring-rum/ ).
28 August 2017 • 29m and 31s
Spoiler Alert: Serverless doesn’t mean that we got rid of servers. We just don’t have to think about them anymore as we can focus on coding functions that get executed when triggered through certain events. Daniel Khan (@dkhan) tells us more about use cases of Serverless or as he likes to call it “Function as a Service” (FaaS). We also chat a lot about monitoring and the challenges of actually monitoring and debugging serverless code. It is still a young technology but constantly evolving.
14 August 2017 • 26m and 51s
It sounds like 3 buzzwords, But there is more than that. We were intrigued by the Digital Mastery & Joy ( https://info.dynatrace.com/apm_wc_panera_na_registration.html ) webinar Klaus Enzenhofer @kenzenhofer ( https://twitter.com/kenzenhofer ) did with Panera Bread. In his introductory statement, Klaus cited a recent study from IDG on Digital Customer Experience. The biggest challenges are data silos, poor data quality, redundant data, and missing coordination between departments that manage the individual digital touchpoint channels (Mobile, IoT, Web, Physical, …). In our discussion we find lots of parallels between the problem that DevOps tries to solve and which challenges digital transforming businesses face: Silos! Disconnected Silos! But instead of Silos between Dev & Ops its Silos between your Business Teams that are all strictly focusing on their slice of bread (to reference some great stories from Prashant Karre, Director of Performance Engineering at Panera) Listen in and join our conversation. Make sure to check out the webinar recording Digital Mastery & Joy ( https://info.dynatrace.com/apm_wc_panera_na_registration.html )
31 July 2017 • 21m and 54s
What is Cloud Foundry? And why does Alois Mayr (@mayralois) say that Cloud Foundry is the most opinionated PaaS Platform in the world? Listen to this 101 show to get a good overview of what Cloud Foundry (CF) is, how it started and what offerings are available right now. Also learn what the main use cases are for CF Cloud Operators as well as for Developers that use the platform to push their applications and services. If you want to learn more or see Alois in action make sure to watch his Full Stack Monitoring on Cloud Foundry PurePerformance Clinic. ( https://www.youtube.com/watch?v=vsQHtdeizXQ&index=65&list=PLqt2rd0eew1bmDn54E2_M2uvbhm_WxY_6&t=1237s )
17 July 2017 • 50m and 26s
If you wonder what the top 3 ways are to pronounce Azure, then check out this episode. Also, if you want to learn more about what Azure really provides, why it used to be ahead of the curve, and why Microsoft had to re-invent it to provide services that software companies really needed, you won't want to miss this episode. Martin Gutenbrunner (@MartinGoodwell) gives us a good overview of the key Azure services and use cases that make Azure an interesting platform for many enterprises. It might also be surprising that it is not just a Microsoft-lock in Technology Stack. Besides .NET, there are many technologies that companies can use to run their applications on Azure IaaS, but more so on Azure Service Fabric. Listen in and learn the core fundamentals of Azure, why it might be interesting for you and what role monitoring plays.
3 July 2017 • 55m and 13s
If you think Node.js is just a technology used by small start ups then you better listen to this 101 episode. Daniel Khan (@dkhan) – a member of the Node.js community and working group – answers a lot of questions on why large enterprises such as Walmart, Paypal or Intuit use Node.js to innovate. Daniel also explains the internals of Node.js, its event driven processing model, its non-blocking asynchronous nature, and how that enables a list of interesting use cases. We also discuss how to monitor and optimize applications running on Node.js and why that might be different for a developer as compared to an Ops team that runs Node.js in combination with other enterprise software.
19 June 2017 • 47m and 38s
What is OpenStack? Oh – it's not the same as OpenShift? So what is OpenStack? If these questions are on your mind and you want to learn more about why OpenStack is used by many large organizations to build their own private cloud offering than listen to this 101 talk with Dirk Wallerstorfer (@wall_dirk). We learn about the different OpenStack core controller services (Cinder, Horizon, Keystone, Neutron, Nova …) as well as the core cloud services (Compute, Storage, Network, …) it provides to its users. Dirk also explains why and who is moving to OpenStack and what the challenges and use cases are when it comes to monitoring OpenStack environments – both for an OpenStack Operator as well as for the Application Owners that run their apps on OpenStack.
5 June 2017 • 35m and 59s
Todd DeCapua has been a performance evangelist for many years. In his recent work and publications, which includes Effective Performance Engineering ( http://www.effectiveperformanceengineering.com/ ) as well as several publications on outlets such as TechBeacon ( https://techbeacon.com/contributors/todd-decapua ), he introduces DevOps best practices to improve the 5 S-Dimensions: Speed, Stability, Scalability, Security and Savings. In our discussion with Todd we focused a lot on Security as it has been a more prominent topic in our industry recently. How to bake Security into the delivery pipeline and why it is such an important aspect. Automation seems to be the key which also includes automating functional checks, performance checks and – as we said: Security! Related Links: Follow Todd on Twitter ( https://twitter.com/AppPerfEng ) Follow Todd on LinkedIn ( http://www.linkedin.com/in/todddecapua ) Blog: How to build performance into your user stories ( https://techbeacon.com/how-build-performance-your-user-stories )
22 May 2017 • 53m and 50s
For our one year anniversary episode, we go “back to basics”, or, better said, “back problem patterns”. We picked three patterns that have come up frequently in recent “Share Your PurePath” sessions from our global user base and try to give some advice on how to identify, analyze and mitigate them: · Bad Multi-threading: Multi-threading is not a bad thing – but if done wrong it doesn’t allow your application to scale. We discuss key server metrics and how to correctly read multi-threaded asynchronous PurePaths. Also see the following blog: https://www.dynatrace.com/blog/how-to-analyze-problems-in-multi-threaded-applications/ · When Micro Service become Nano Services. This was inspired by a blog from Steven Ledoux ( https://www.dynatrace.com/blog/micro-services-when-micro-becomes-nano/ ). It's important to keep a constant eye on your micro-service architecture to avoid too tightly coupled or too fine grained architectures · Garbage Collection Impact: GC is important but bad memory management and heavy GC can potentially impact your critical transactions. We discuss different approaches on how to correctly measure the impact of garbage collection suspension. If you want to learn more check out the Java Memory Management secton of our online performance book: https://www.dynatrace.com/resources/ebooks/javabook/impact-of-garbage-collection-on-performance/
8 May 2017 • 47m and 31s
In this second episode with Goranka Bjedov from Facebook, we learn details about how Facebook monitors their infrastructure, services, applications and end users. Why they built certain tooling, and how & who analyzes that data. We then shifted gears to development where we learned how the onboarding process of developers works and that Goranka herself made her first production deployment within the first week of employment. Join us and learn a lot about the culture that drives Facebook Engineering
24 April 2017 • 36m and 52s
Goranka Bjedov ( https://www.linkedin.com/in/goranka-bjedov-5969a6/ ) has an eye over the performance of thousands of servers spread across the data centers of Facebook. Her infrastructure supports applications such as Facebook Social Network, WhatsApp, Instagram and Messenger. We wanted to learn from her how to manage performance in such scale, how Facebook engineers bring new ideas to the market and what role performance and monitoring plays.
24 April 2017 • 47m and 43s
In the second episode with Rick Boyd (check out his GitHub repo - https://github.com/DJRickyB ) we talk about how performance engineering evolved over time – especially in an agile and DevOps setting. It’s about how to evolve your traditional performance testing towards injecting performance engineering into your organizational DNA, providing performance engineering as a service. Making it easy accessible to developers whenever they need performance feedback. Rick gives us insights on how he is currently transforming performance engineering at IBM Watson. We also gave a couple of shout outs to Mark Tomlinson and his take on Performance in a DevOps world!
10 April 2017 • 44m and 7s
We got Rick Boyd ( https://www.linkedin.com/in/richardjboyd/ ) – Application Performance Engineer at IBM Watson – and elaborated what Continuous Performance Testing is all about. We all concluded its about Faster Feedback in the development cycle back to the developers – integrated into your delivery pipeline. As compared to delivering performance feedback only at the end of a release cycle. We discussed different approaches on how to “shift left performance” with the benefit of continuous performance feedback!
27 March 2017 • 43m and 34s
Brett Hofer (@brett_solarch) has been engaged in numerous DevOps Transformation projects mainly for very large enterprises. We got to talk with him on this episode to learn more about how he assesses the status quo when he walks into an organization, what the top blocking items for a successful transformation are and what the best approaches are to implement the recommended changes. Spoiler alert: we talked a lot about IT Ops Automation, building cross functional teams and understanding and defining responsibilities and roles. If you want to learn more about what Brett is doing check out his blogs about DevOps on https://www.dynatrace.com/blog/author/brett-hofer/.
13 March 2017 • 40m and 15s
Thomas McGonagle just had his 10 years DevOps anniversary at it was 10 years ago when he got first exposed to Infrastructure as Code through Puppet. He is currently working with F5, helping Big IP Network Teams around the world automate the Network as part of their DevOps transformation. We met Tom at a recent DevOps meetup in Boston which sparked this conversation on what “Metrics Driven Continuous Delivery” could mean for Network Operations Engineers. What are the metrics to look at? How to engage with the application teams to provision better and automated network resources? How to bake this into the Continues Delivery Cycle? Besides NetOps Thomas is also passionate about CI/CD. He runs the largest Jenkins User Group in the World out of Boston, MA. If you happen to be around check out their next meetups and DoJo’s: https://www.meetup.com/Boston-Jenkins-Area-Meetup/
27 February 2017 • 28m and 41s
Mike Horwitz (https://www.linkedin.com/in/mike-horwitz-a40a139 ) has been working with Mainframe since the mid 80s. In this podcast he explains basic terminology and the challenges that come with the interaction to the distributed and cloud native world. Monitoring end-to-end is a critical capability especially when it comes to cost savings and including the mainframe components in a CI/CD/DevOps environment. If you want to learn more about common mainframe performance and monitoring challenges check out our YouTube Performance Clinic: https://www.youtube.com/watch?v=8eodOw3gnMA&list=PLqt2rd0eew1bmDn54E2_M2uvbhm_WxY_6&index=55
13 February 2017 • 43m and 50s
INFO COMMENTS We take time out chat with Jason Suss and Dynatrace RUM on RUM, we search for Richard Bentley at the nightclub, Vikram survived the performance puzzlers, Rick Boyd from IBM Watson and Stefan Baumgartner tells us all about his work at Dynatrace and podcasting at http://workingdraft.de
9 February 2017 • 52m and 4s
Josh McKenty of Pivotal
9 February 2017 • 44m and 39s
Brian Wilson takes time from his feverish disco dancing to have several interviews with attendees at the Tuesday evening party at Dynatrace 2017 in Las Vegas.
8 February 2017 • 29m and 56s
Good morning Las Vegas! We chat today about how to apply logic to Dynatrace Davis, we hear an interesting performance story from Lianggui and an industry update from Vice President of Consulting at CGI, Walter Kuketz.
8 February 2017 • 50m and 56s
Wrapping up the day with interviews from Henrik Rexed from Neotys and our new friends Rajesh Jain and Maggie Ambrose from Pivotal Labs.
8 February 2017 • 46m and 22s
We continue coverage of Dynatrace Perform 2017 with interviews from John Delfeld of Ixia, our favorite performance geek Andreas Grabner from Dynatrace and strategic partner Ryan Faulk of Faulk Consulting.
7 February 2017 • 46m and 15s
7 February 2017 • 1h, 6m and 38s
Woo hoo!! We're kicking off the Dynatrace conference in Las Vegas at the Cosmopolitan Hotel!
7 February 2017 • 1h, 4m and 9s
Eric Wright (@discoposse) is a “veteran” and expert when it comes to virtualization and cloud technologies. He introduces us into the field of container and container orchestrations, the vendors in the space, the pros and cons and the key capabilities he things have to be considered when evaluating the next generation virtualization platform for your enterprise. If you want to learn more check out his podcast - http://gcondemand.podbean.com/ - as well as his publications on https://turbonomic.com/author/eric-wright/
30 January 2017 • 49m and 55s
How often have you deployed an application that was supposed to be load tested well but then crashed in production? One of the reasons might be that you never took the time to really analyze real life load patterns and distributions. Brian Chandler (@Channer531) (https://www.linkedin.com/in/brian-chandler-8366663b ) – Performance Engineer at Raymond James – has worked with their Operations Team to not only start loving application specific performance data captured in production. They starting breaking down the DevOps Walls from Right to Left by sharing this data with Testers to create more realistic load tests but also started education developers to learn from real life production issues. We hope you enjoy this one as we learn a lot of cool techniques, metrics and dashboards that Brian uses at Raymond James. If you want to see it live check out our webinar where he presented their approach as well: https://info.dynatrace.com/apm_wc_getting_started_with_devops_na_registration.html You can view the screenshots we refer to at: https://assets.dynatrace.com/en/images/general/Chandler_01.jpg https://assets.dynatrace.com/en/images/general/Chandler_02.jpg
17 January 2017 • 50m and 16s
HAPPY NEW YEAR Daniel Freij (@DanielFreij) – Senior Performance Engineer and Community Manager at Apica – has been doing hundreds of load tests in his career. 5-10 years ago performance engineers used the “well known” load testing tools such as Load Runner. But things have changed as we have seen both a Shift-Left and a Shift-Right of performance engineering away from the classical performance and load testing teams. Tools became easier, automatable and cloud ready. In this session we discuss these changes that happened in the recent years, what it means for today’s engineering teams and also what might happen in 5-10 years from now. We also want to do a shout out to a performance clinic Daniel and Andi are doing on January 25th 2017 where they walk you through a modern cloud based pipeline using AWS CodePipeline, Jenkins, Apica and Dynatrace. Registration link can be found here: http://bit.ly/onlineperfclinic Related Link: ZebraTester Community: https://community.zebratester.com/
2 January 2017 • 50m and 36s
Mark Tomlinson, still a veteran and performance god, is enlightening us on his concept of Continuous Acceleration of Performance. Continuous Delivery is all about getting faster feedback from code changes as code gets deployed faster in smaller increments to the end user. One aspect that is often left out is feedback on performance metrics and behavior. In the “old days” performance feedback was given very late – either in the load testing phase at the end of the project lifecycle or even as late as when it hits production. That could be too late and it makes it hard to fix the root cause. Listen to our conversation on how to accelerate performance related feedback loops without getting overwhelmed with too much data!
19 December 2016 • 34m and 31s
Mark Tomlinson, “a veteran” in Performance Engineering, discusses how DevOps is a big opportunity for performance engineering – but also a threat for many that have been in the business for a long time. The big question is: are “traditional performance engineers” using their Load Runners or SilkPerformers at the end of the project lifecycle ready to change? Ready to learn new tools? Ready to think about automating performance engineering into the delivery pipeline and doing that in collaboration with the rest of the engineering team? Ready to “Check your Ego at the door”? Listen to our conversation where we also discuss how these roles have changed in organizations we recently interacted with.
5 December 2016 • 32m and 57s
In Part II with Finn Lorbeer (@finnlorbeer) from Thoughtworks we discuss some of the new approaches when implementing new software features. How can we build the right thing the right way for our end users? Feature development should start with UX wireframes to get feedback from end users before writing a single line of code. Feature teams then need to define and implement feedback loops to understand how features operate and are used in production. We also discuss the power of A/B testing and canary releases as it allows teams to “experiment” on new ideas and thanks to close feedback loops will quickly learn on how end users are accepting it. *****Related Links:****** Process Automation and Continuous Delivery at OTTO.de https://dev.otto.de/2015/11/24/process-automation-and-continuous-delivery-at-otto-de/ Are we only Test Manager? http://www.lor.beer/are-we-only-test-manager/ Sind wir wirklich nur Testmanagerinnen? https://dev.otto.de/2016/06/08/sind-wir-wirklich-nur-testmanagerinnen/
21 November 2016 • 36m and 59s
Finn Lorbeer (@finnlorbeer) is a quality enthusiast working for Thoughtworks Germany. I met Finn earlier this year at the German Testing Days where he presented the transformation story at Otto.de. He helped transform one of their 14 “line of business” teams by changing the way QA was seen by the organization. Instead of a WALL between Dev and Ops the teams started to work as a real DevOps team. Further architectural and organizational changes ultimately allowed them to increase deployment speed from 2-3 per week to up to 200 per week for the best performing teams. *****Related Links:****** Process Automation and Continuous Delivery at OTTO.de https://dev.otto.de/2015/11/24/process-automation-and-continuous-delivery-at-otto-de/ Are we only Test Manager? http://www.lor.beer/are-we-only-test-manager/ Sind wir wirklich nur Testmanagerinnen? https://dev.otto.de/2016/06/08/sind-wir-wirklich-nur-testmanagerinnen/ Das Leben ist hasselhoff http://giphy.com/search/david-hasselhoff
21 November 2016 • 40m and 57s
7 November 2016 • 20m and 20s
7 November 2016 • 26m and 44s
7 November 2016 • 34m and 16s
Guest Star: Anita Engleder - DevOps Manager at Dynatrace In this second part of our podcast Anita gives us more insights into how new features actually get developed, how they measure their success and how to ensure that the pipeline keeps up with the ever increasing number of builds pushed through it. We will learn more about the day-to-day life at Dynatrace engineering but especially about the “Lifecycle of a Feature, its feedback loop and what the stakeholders are doing to make it a success” Related Links: Dynatrace UFO https://github.com/Dynatrace/ufo
25 October 2016 • 47m and 34s
Guest Star: Anita Engleder - DevOps Manager at Dynatrace As a follow up to our podcast with Bernd Greifender, CTO and Found of Dynatrace, who talked about his 2012 mission statement to the engineering team: “We go from 6 months 2 weeks release cycles” we now have Anita Engleder, DevOps Lead at Dynatrace on the mic. Anita has been part of that transformation team and in the first episode talks about what happened from 2012 until 2016 where the engineering team is now deploying a feature release every other week, makes 170 production deployment changes per day and can push a code change into production within an hour if necessary. She will give us insights in the processes, the tools but more importantly about the change that happened with the organization, the people and the culture. She will also tell us what she and her “DevOps” team actually contribute to the rest of the organization. Are they just another new silo? Or are they an enabler for engineering to push code faster through their pipeline?
25 October 2016 • 34m and
We got to talk with Bernd Greifeneder, Founder and CTO of Dynatrace, who recently gave a talk on “From 0 to NoOps in 80 Days” explaining the “Digital Transformation Story of Dynatrace – the product as well as the company” The transformation started in 2012 when Dynatrace used to deploy 2 major releases of its Dynatrace AppMon & UEM product to the market. The incubation of the startup Ruxit within Dynatrace allowed engineering, marketing and sales to come up with new ways and ideas that allow continuous innovating. In 2016 the incubated team was brought back to Dynatrace to accelerate the “Go To Market” of all the innovations. A new version of its Dynatrace SaaS and Managed offering is now released every 2 weeks with 170 production updates per day. Many aspects were also applied to all other product lines and engineering teams which boosted the output and raised quality of these enterprise products.
10 October 2016 • 52m and 17s
Are there new Web Performance Rules since Steve Souders started the WPO movement about 10 years ago? Do we still optimize on round trips or does HTTP/2 change the game? How do we deal with “mobile only” users we find in emerging geographies. How does Google itself optimize its search pages and what can we learn from it. In this session we really got to cover a lot of the presentation Pat Meenan (@patmeenan) did at Velocity this year. Related Links: * Scaling frontend performance - Velocity 2016 ***** https://www.youtube.com/watch?v=LdebARb8UJk * WEBPAGETEST ***** https://www.webpagetest.org * Google AMP ***** https://www.ampproject.org/ ***** https://github.com/ampproject/amphtml
26 September 2016 • 41m and 24s
Pat Meenan (@patmeenan) is a veteran when it comes to Web Performance Optimization. Besides being the creator of WebPageTest.org he has also done a lot of work recently on the Google Chrome team to make the browser better and faster. During his recent Velocity presentation on “Using Machine Learning to determine drivers for bounce and conversion” he presented some very controversial findings about what really impacts end user happiness. That it was not rendering time but rather DOM Load Time that correlates with conversion and bounce rates. In this session we dig a bit deeper into which metrics you can capture from your website and presented them to your business side as an argument for investing in faster websites. Find out which metric you really need to optimize in order to “move the needle” Related Links: * Using machine learning to determine drivers of bounce and conversion - Velocity 2016 ***** https://www.youtube.com/watch?v=TOsqP16jnDs * WEBPAGETEST ***** https://www.webpagetest.org/ * WPO-Foundation Github repository for machine learning ***** https://github.com/WPO-Foundation/beacon-ml
26 September 2016 • 33m and 42s
Adam Auerbach (@Bugman31) has helped Capital One transform their development and testing practices into the Digital Delivery Age. Practicing ATDD and DevOps allows them to deploy high quality software continuously. One of their challenges has been the rather slow performance testing stage in their pipeline. Breaking up performance test into smaller units, using Docker to allow development to run concurrency and scalability tests early on, and automating these tests into their pipeline are some of the actions they have taken to level-up their performance engineering practices. Listen to this podcast to learn about how Capital One pushes code through the pipeline, what they have already achieved in their transformation and where the road is heading. Related Links: * Hygea Delivery Pipeline Dashboard https://github.com/capitalone/Hygieia * Capital One Labs http://www.capitalonelabs.com/#welcome * Capital One DevExchange https://developer.capitalone.com/
12 September 2016 • 51m and 37s
Do you speak SQL? Do you know what an Execution Plan is? Are you aware that large amounts of unique queries will impact Database Server CPU and also efficiency of the Execution Plan and Data Cache? These are all learnings from this episode where Sonja Chevre (@SonjaChevre) and Harald Zeitlhofer (@HZeitlhofer) – both database experts at Dynatrace – pointed out database performance hotspots and optimizations that you many of us probably never heard about. Watch the Online Performance Clinic -Database Diagnostics Use Cases with Dynatrace https://www.youtube.com/watch?v=pEXfqzE-WQM
29 August 2016 • 44m and 15s
Are you still exporting load testing reports into Excel compare different runs manually? Matt Eisengruber – Guardian at Dynatrace – walks us through the life-changing transformation story of one of his former clients who used to spend an entire business day analyzing LoadRunner results. Through automation, they managed to get her the results when she walks into the office in the morning – giving her more time to do “real” business analyst work instead of doing manual number crunching. Matt shares some insights into what exactly it is they did to automate Dynatrace Load Test comparison, how they created the reports and which metrics they ended up looking at.
15 August 2016 • 50m and 9s
Alois Reitbauer (@AloisReitbauer) guest hosts - Mike Jones ( http://bit.ly/mjlnk ) takes us on a journey how the team moved a monolithic application that was built by a remote team to a micro service architecture. Learn how the manage a couple of million lines of code with only 5 people while improving performance and availability. Mike also shares lessons learned on their journey and shares strategies on how to make the transition to micro services while having to keep the lights on for day-to-day business.
8 August 2016 • 36m and 11s
Scott Stocker (@sestocker), Solution Architect at Perficient, tells us the background of a recent load testing engagement on an ASP.NET App running on SiteCore. Turns out that even these apps on the popular Microsoft platform suffer from the same architectural and implementation patterns as we see everywhere else. Bypassing the caching layer through FastQuery resulted in excessive SQL, which caused the system to not scale, but crumble. Scott tells us how they identified this issue and what his approach as an architect is to proactively identify most common performance and scalability problems.
1 August 2016 • 55m and 25s
The initial idea of the Cloud has long become commodity – which is IaaS. Containers are the current hype but still require you to take care of correctly configuring your container that will run your code. Mike Villiger (@mikevilliger) – a veteran and active member of the cloud community – explains why it is really PaaS that should be on top of your list. And why monitoring performance, architecture and resource consumption is more important than ever in order for your PaaS Adventure not to fail. Related article: http://www.it20.info/2016/03/the-incestuous-relations-among-containers-orchestration-tools/
18 July 2016 • 56m and 47s
In Part II, Richard Dominguez, Developer in Operations at PrepSportswear, is explaining the significance of understanding and dealing with bot and spider traffic on their eCommerce site. He explains why they route search bot traffic to dedicated servers, how to better serve good bots and how to block the bad ones. Most importantly: we learn about a lot of metrics he is providing for the DevOps but also the marketing teams to run a better online experience!
1 July 2016 • 39m and 1s
Have you ever wondered how to argue with a marketeer about not releasing a new feature or running this campaign? Or in the contrary: how can you show a marketeer that performance engineering and monitoring is as critical to the success of a campaign as the marketing campaign itself? Richard Dominguez, Developer in Operations at PrepSportswear, is enlightening us about how his DevOps team is cooperating with marketing to have a better shared understanding between business and technical goals!
1 July 2016 • 50m and 56s
Microsoft is doing a good job in shielding the complexity of what is going on in the CLR from us. Until now Microsoft is taking care to optimize the Garbage Collector and tries to come up with good defaults when it comes to thread and connection pool sizes. The problem though is that even the best optimizations from Microsoft are not good enough if your application suffers from poor architectural decisions or simply bad coding. Listen to this podcast to learn about the top problems you may suffer in your .NET Application. We have many examples and we discuss how you can do a quick sanity check on your own code to detect bad database access patterns, memory leaks, thread contentions or simply bad code that results in high CPU, synchronization or even crashes!
20 June 2016 • 56m and 39s
The Java Runtime has become so fast that it shouldn’t be the first one to blame when looking at performance problems. We agree: the runtime is great, JIT and garbage collection are amazing. But bad code on a fast runtime is still bad code. And it is not only your code but the 80-90% of code that you do not control such as Hibernate, Spring, App-Server specific implementations or the Java Core Libraries. Listen to this podcast to learn about the top Java Performance Problems we have seen in the last months. Learn how to detect bad database access patterns, memory leaks, thread contentions and – well – simply bad code resulting in high CPU utilization, synchronization issues or even crashes!
6 June 2016 • 57m and 30s
How can you performance test an application when you get a new build with every code check-in? Is performance testing as we know it still relevant in a DevOps world or do we just monitor performance in production and fix things as we see problems come up? Continuous Delivery talks about breaking an application into smaller components that can be tested in isolation but also deployed independently. Performance Testing is more relevant than ever in a world where we deploy more frequently – however – the approach of executing these tests has to change. Instead of executing hourly long performance test on the whole application we also need to break down these tests into smaller units. These tests need to be executed automatically with every build – providing fast feedback on whether a code change is potentially jeopardizing performance and scalability Listen to this podcast to get some new insights and ideas on how to integrate your performance tests into your Continuous Delivery Process. We discuss tips&tricks we have seen from engineering teams that made the transition to a more “agile/devopsy” way to execute tests
24 May 2016 • 49m and 15s
Have you ever wondered what other people mean when they talk about a performance or load or stress test? What about a penetration test? There are many definitions floating around and things sometimes get confused. Listen to this podcast and let us clarify for you by giving you our opinion on the different types of tests that are necessary when testing an application. In the end you can make up your own mind what best term to use. Special Guest: Mark Tomlinson (@mtomlins) of PerfBytes
5 May 2016 • 57m and 5s
If you are running load tests it is not enough to just look at response time and throughput. As a performance engineer you also have to look at key components that impact performance: CPU, Memory, Network and Disk Utilization should be obvious. Connection Pools (Database, Web Service), Thread Pools and Message Queues have to be part of monitoring as well. On top of that you want to understand how your individual components that you test (frontend server, backend services, database, middleware, …) communicate with each other. You need to identify any communication bottlenecks because of too chatty components (how many calls between tiers) and to heavy weight conversations (bandwidth requirements). Listen to this podcast and learn which metrics you should look at while running your load test. As performance engineer you should not only report that the app is slow under a certain load but also give recommendations on which components are to blame.
5 May 2016 • 49m and 29s