What Is Resilience Engineering?


But modern engineering practices have moved beyond this fear, giving birth to a new practice in DevOps and site reliability engineering (SRE) known as resilience engineering. Resilience engineering today isn’t thought of as a function.

Data engineers vs. data scientists

O'Reilly Media - Data

It’s important to understand the differences between a data engineer and a data scientist. I think some of these misconceptions come from the diagrams that are used to describe data scientists and data engineers. Overly simplistic venn diagram with data scientists and data engineers.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Software Developer vs Software Engineer — Differences: Bogus or Real?

Dzone - DevOps

From the desk of a brilliant weirdo #1: “ The difference between a software developer and a software engineer is the same as the difference between big and large. ”?—?Unknown.

Why is Site Reliability Engineering Important?


Site reliability engineering (SRE) is one of the fastest-growing enterprise roles and set of operational practices for managing services at scale. During the Site Reliability Engineering SKILup Day June 18, I asked a group of the DevOps Institute Ambassadors why SRE was important to them. The post Why is Site Reliability Engineering Important? Blogs DevOps Practice devops institute site reliability engineering SKILup Day

Why Distributed Tracing is Essential for Performance and Reliability

Speaker: Daniel "spoons" Spoonhower, CTO and Co-Founder at Lightstep

Many engineering organizations have now adopted microservices or other loosely coupled architectures, often alongside DevOps practices. Together these have enabled individual service teams to become more independent and, as a result, have boosted developer velocity. However, this increased velocity often comes at the cost of overall application performance or reliability. Worse, teams often don’t understand what’s affecting performance or reliability – or even who to ask to learn more. Distributed tracing was developed at organizations like Google and Twitter to address these problems and has also come a long way in the decade since then. By the end of this presentation, you’ll understand why distributed tracing is necessary and how it can bring performance and reliability back under control.

Recommendation Engines 101


Recommendation engines are everywhere today, whether explicitly offered to users (e.g., And while building a simple recommendation engine can be quite straightforward, the real challenge is to actually build one that works and where the business sees real uplift and value from its output.

The Spark Engine

Agile Alliance

This book describes creative exercises that foster team-building and warm up the right side of your brain – the creative side. People

Site Reliability Engineering 101: DevOps Versus SRE


The post Site Reliability Engineering 101: DevOps Versus SRE appeared first on DevOps.com. Blogs DevOps Practice devops site reliability engineering SRE SRE engineersConsider the following scenario. An Independent Software Provider (ISV) developed a financial application for a global investment firm that serves global conglomerates, leading central banks, asset managers, brokerage firms and governmental bodies.

Designing Engineering Teams for Scale


As a senior engineering manager who has taken many companies from just a few engineers to entire teams at global scale in hypergrowth, I have learned quite a few lessons along the way on coping with growth and scale on a diversity of fronts. The post Designing Engineering Teams for Scale appeared first on DevOps.com. Blogs Leadership Suite engineering teams managing operations scaling challenges

Data engineering: A quick and simple definition

O'Reilly Media - Data

Get a basic overview of data engineering and then go deeper with recommended resources. As the the data space has matured, data engineering has emerged as a separate and related role that works in concert with data scientists.

Nine Developer Enablement Practices to Achieve DevOps at Enterprise Scale

In this eBook, Christian Oestreich, a senior software engineering leader with experience at multiple Fortune 500 companies, shares how a metrics-driven mindset can dramatically improve software quality and enable DevOps at enterprise scale.

Data-driven leaders make stronger engineering teams


Comprehensive advice on how engineering leaders can leverage data to build trust, understand their team and move the needle

The Principles of Chaos Engineering

Dzone - DevOps

devops chaos engineering chaos monkey chaos chaos testingResilience is something those who use Kubernetes to run apps and microservices in containers aim for. When a system is resilient, it can handle losing a portion of its microservices and components without the entire system becoming inaccessible. Resilience is achieved by integrating loosely coupled microservices. When a system is resilient, microservices can be updated or taken down without having to bring the entire system down.

Engineering Impact


We curate the best technology leadership articles from around the web

What machine learning engineers need to know

O'Reilly Media - Data

This conversation grew out of a recent email thread the three of us had on machine learning engineers , a new job role that LinkedIn recently pegged as the fastest growing job in the U.S. Continue reading What machine learning engineers need to know

Microservices: The Dark Side

Speaker: Prem Chandrasekaran

A few years ago, Barclays embarked on a journey to migrate its legacy services to a modern tech stack with the objective of achieving a high level of scale, resilience, and reliability. During the incremental, iterative move towards an ecosystem of focused, distributed services, it’s fair to say that the company discovered the challenges of distributing objects. Join us for an episode you can’t miss if you’re finding it a lot harder to architect in practice than in theory, and learn how you can shed light on the dark side of microservices.

Live Site Culture & Site Reliability Engineering


Thats more than the number of engineers that they have. The Combined Engineering model adopted by the Azure DevOps Product Team at Microsoft is an example of how one group of about 650 engineers in 3 main locations around the world has evolved to be able to cope with these new realities.

Engineering Productivity Lessons From COVID-19


The post Engineering Productivity Lessons From COVID-19 appeared first on DevOps.com. The world became a different place post-COVID-19 and how we work, communicate and collaborate has been redefined, possibly forever. Our company has a distributed team across the U.S. and India for development and operations. When “shelter in place” hit the entire world, we were not sure how we’d be able to execute our projects, customer […].

From Theater Production to Production Engineer at Bungie | How Bellevue Graduate Hannah Preisinger Landed Her Dream Job

Coding Dojo

The post From Theater Production to Production Engineer at Bungie | How Bellevue Graduate Hannah Preisinger Landed Her Dream Job appeared first on Coding Dojo Blog. Pre-Dojo : Owned a theater production company and worked in various roles at a local college. Didn’t have steady income.

The Importance of Reliability Engineering

Dzone - DevOps

Site Reliability Engineering is the defining movement in tech today. But what makes reliability engineering so important? devops resilience site reliability engineering site reliability site reliability engineerIf you’ve spent any time in tech circles lately, there are three letters you’ve surely heard: SRE. Giants like Google and Amazon market their ability to provide reliable service and startups are now investing in reliability as an early priority.

2019 State of Engineering Performance Management Report

More than 100 software leaders were asked how they manage and measure engineering performance. How do the findings stack up to your own experience? The report includes the top engineering challenges and the most used performance metrics.

Chaos Engineering and Continuous Verification in Production


Moreover, he explained how Continuous Verification can help software engineers avoid such pitfalls. But so, Casey, you are the man who literally wrote the book on chaos engineering or co-wrote the book, which is recently out from O’Reilly. So, your customers are paying you for complexity, like that’s as one way to view your job as a software engineer is you’re adding complexity to a product. It is the gold standard to do chaos engineering in production.

The Time Has Come for an Engineering Approach to DevOps


The post The Time Has Come for an Engineering Approach to DevOps appeared first on DevOps.com. Blogs DevOps Culture DevOps in the Cloud DevOps Practice DevOps Toolbox DevSecOps Enterprise DevOps continuous delivery devops DevOps assessment digital transformation engineering Engineering DevOps PillarsDo you want to know how to do DevOps? Are you doing DevOps but not satisfied with the results you are getting? DevOps is complex.

Four Metrics Every Engineering Manager Needs to Monitor for Success


Engineering managers and leaders are charged with a challenging task every day: How do they support engineers to do their best work, and how do they know if their engineers are doing their best work? To enable an engineer’s best work, you’ll want to make sure they are […]. The post Four Metrics Every Engineering Manager Needs to Monitor for Success appeared first on DevOps.com.

Automatic Feature Engineering: An Event-Driven Approach


TL;DR : As data science evolves as a discipline, we think that it’s time to have a more systematic approach to feature engineering. That’s why we built EventsAggregator , an automatic feature engineering framework that leverages user’s knowledge to generate expressive features.

How Top Engineering Leaders Build High-Performance Teams That Deliver Results

The primary responsibilities of Engineering Leadership - essentially, VPs of Engineering and CTOs - are building high-performance teams and delivering high-quality products on time, which together drive business results.

Engineering Inclusivity with Rich Archbold

Marcus Blankenship - Podcasts

In this episode of Programming Leadership, Marcus talks with Rich Archbold, vice president of engineering at Intercom.io, about developing an inclusion and diversity plan for his company. What Rich has been learning about diversity and inclusion in engineering teams (1:22). Rich is the Vice President of Engineering at Intercom.io, a fantastic SaaS platform which helps you connect better with your customers. Engineers are largely mathematical. Episode 43.

An Architect?s Introduction to Chaos Engineering

Dzone - DevOps

We may attempt to engineer our systems expecting certain failures only to make things worse, such as a well-intended retry logic overloading a stressed server even worse and causing failures to cascade across the enterprise. In the old days, you had a person or one team who understood your system so well that they could engineer against most failures or immediately diagnose and fix the unexpected production failures that did slip through. Your infrastructure will fail.

How to Become a DevSecOps Engineer


One of the most important roles within DevOps, however, is the Security Engineer or DevSecOps Engineer. The post How to Become a DevSecOps Engineer appeared first on DevOps.com. Blogs DevSecOps careers devops devsecops DevSecOps Engineer securityThe popularity and success of DevOps means it’s now an increasingly common feature in IT job descriptions.

Article: How to Get Hired as a Machine Learning Engineer

InfoQ Culture Methods

To become a machine learning engineer, you have to interview. interviewing An introduction to Machine Learning Machine Learning Culture & Methods AI, ML & Data Engineering articleYou have to gain relevant skills from books, courses, conferences, and projects. Include technologies, frameworks, and projects on your CV. In an interview, expect that you will be asked technical questions, insight questions, and programming questions.

The Product Dev Conundrum: To Build or Buy in a Digital World?

Speaker: Mark Ridley, Owner and Founder, Ridley Industries

Any PM or technical leader who’s led the charge of building a digital product knows that product engineering is one of the most expensive elements of business. In most cases, actually building software should be the last avenue that companies explore to deliver great products, as software engineering, data science and product development are complex, uncertain and hard to manage.

Engineering Impact


We curate the best technology leadership articles from around the web

Chaos Engineering: What It Means, Why It Matters

The New Stack

Chaos engineering certainly evokes a lot of interest these days, especially as organizations increasingly rely on widely distributed data infrastructures that can extend across multicloud and on-premise environments — where the risk of failure grows exponentially.

Agile Product Development: Engineering Practices


Engineers are taking so much care of quality and engineering, developing cutting edge software.

Exceptional QA Engineer Skills: 6 Valuable Traits

Gorilla Logic

Have you ever considered what it means to be a tester or quality assurance (QA) engineer? If that’s all that’s involved, we might assume that a QA engineer’s skills are minimal and they don’t require any special ability, training, or knowledge. In fact, you will find testers and QA engineers who don’t have any specific training or background for their jobs. As a QA engineer at Gorilla Logic , I can tell you that this isn’t the case at all.

How Product Managers Can Learn to Love Reporting

Speaker: Eric Feinstein, Professional Services Manager, Looker

It seems like PMs and engineers have grown to hate embedded reporting. How to evaluate embedded analytic solutions as strategy to greatly reduce initial and on-going engineering effort.