article thumbnail

Fundamentals of Data Engineering

Xebia

The following is a review of the book Fundamentals of Data Engineering by Joe Reis and Matt Housley, published by O’Reilly in June of 2022, and some takeaway lessons. This book is as good for a project manager or any other non-technical role as it is for a computer science student or a data engineer.

article thumbnail

Make the leap to Hybrid with Cloudera Data Engineering

Cloudera

When we introduced Cloudera Data Engineering (CDE) in the Public Cloud in 2020 it was a culmination of many years of working alongside companies as they deployed Apache Spark based ETL workloads at scale. It’s no longer driven by data volumes, but containerization, separation of storage and compute, and democratization of analytics.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

How to Turn your Data Center into a True Private Cloud

Cloudera

These companies have instead opted to leverage their existing data centre investment. Turning the data center into a private cloud would bring all the agility and flexibility of public cloud to the control of an on-premises infrastructure. Move to more Data Services. Next stop: hybrid data cloud.

article thumbnail

What is Data Engineering: Explaining Data Pipeline, Data Warehouse, and Data Engineer Role

Altexsoft

If we look at the hierarchy of needs in data science implementations, we’ll see that the next step after gathering your data for analysis is data engineering. This discipline is not to be underestimated, as it enables effective data storing and reliable data flow while taking charge of the infrastructure.

article thumbnail

5 hot IT budget investments — and 2 going cold

CIO

New York-Presbyterian will also invest in zero trust this year, adding a security operations center (SOC) for 24/7 network monitoring as well, Fleischut says. Cold: On-prem infrastructure As they did in 2022, many IT leaders are reducing investments in data centers and on-prem technologies. “We

Budget 363
article thumbnail

The new challenges of scale: What it takes to go from PB to EB data scale

CIO

Start with storage. Before you can even think about analyzing exabytes worth of data, ensure you have the infrastructure to store more than 1000 petabytes! Going from 250 PB to even a single exabyte means multiplying storage capabilities four times. So, what does it require for organizations to go from PB to EB scale?

Data 159
article thumbnail

2018: A Year in Review for Storage Systems.

Hu's Place - HitachiVantara

For lack of similar capabilities, some of our competitors began implying that we would no longer be focused on the innovative data infrastructure, storage and compute solutions that were the hallmark of Hitachi Data Systems. A REST API is built directly into our VSP storage controllers.