article thumbnail

How FiveStars re-engineered its data engineering stack

CIO

It shows in his reluctance to run his own servers but it’s perhaps most obvious in his attitude to data engineering, where he’s nearing the end of a five-year journey to automate or outsource much of the mundane maintenance work and focus internal resources on data analysis. They wrote bash scripts!”

article thumbnail

Fundamentals of Data Engineering

Xebia

The following is a review of the book Fundamentals of Data Engineering by Joe Reis and Matt Housley, published by O’Reilly in June of 2022, and some takeaway lessons. This book is as good for a project manager or any other non-technical role as it is for a computer science student or a data engineer.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

NJ Transit creates ‘data engine’ to fuel transformation

CIO

Collectively, the agencies also have pilots up and running to test electric buses and IoT sensors scattered throughout the transportation system. Data engine on wheels’. To mine more data out of a dated infrastructure, Fazal first had to modernize NJ Transit’s stack from the ground up to be geared for business benefit.

article thumbnail

Data engineers vs. data scientists

O'Reilly Media - Data

It’s important to understand the differences between a data engineer and a data scientist. Misunderstanding or not knowing these differences are making teams fail or underperform with big data. I think some of these misconceptions come from the diagrams that are used to describe data scientists and data engineers.

article thumbnail

Optimizing Cloudera Data Engineering Autoscaling Performance

Cloudera

At Cloudera, we introduced Cloudera Data Engineering (CDE) as part of our Enterprise Data Cloud product — Cloudera Data Platform (CDP) — to meet these challenges. YuniKorn’s Gang scheduling and bin-packing help boost autoscaling performance and improve resource utilization. fixed sized clusters).

article thumbnail

Analytics operating system Redbird makes data more accessible to non-technical users

TechCrunch

Data engineers have a big problem. Almost every team in their business needs access to analytics and other information that can be gleaned from their data warehouses, but only a few have technical backgrounds. ” Tracking venture capital data to pinpoint the next US startup hot spots.

article thumbnail

DTN’s CTO on combining IT systems after a merger

CIO

Over the years, DTN has bought up several niche data service providers, each with its own IT systems — an environment that challenged DTN IT’s ability to innovate. “We Very little innovation was happening because most of the energy was going towards having those five systems run in parallel.”. The merger playbook.