Remove data-pipelines-are-new-anti-patterns
article thumbnail

My Thoughts on the Future of Modus Create

Modus Create

In December, we announced the news of our first strategic growth investment from JLL Partners — a major milestone that signals the beginning of a new era for Modus Create. For over a decade, we’ve helped organizations across industries adapt to the digital economy with new user-centric products, platforms, and processes.

article thumbnail

Spark Technical Debt Deep Dive

Cloudera

How Bad is Bad Code: The ROI of Fixing Broken Spark Code Once in a while I stumble upon Spark code that looks like it has been written by a Java developer and it never fails to make me wince because it is a missed opportunity to write elegant and efficient code: it is verbose, difficult to read, and full of distributed processing anti-patterns.

Lambda 57
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Kubernetes for Big Data Workloads

Abhishek Tiwari

Kubernetes has emerged as go to container orchestration platform for data engineering teams. In 2018, a widespread adaptation of Kubernetes for big data processing is anitcipated. Organisations are already using Kubernetes for a variety of workloads [1] [2] and data workloads are up next. better cluster resource utilization.

article thumbnail

Data Pipelines: The Hammer for Every Nail

Abhishek Tiwari

In the era of big data and complex data processing, data pipelines have emerged as a popular solution for managing and manipulating data. They provide a systematic approach to extract, transform, and load (ETL) data from various sources, enabling organizations to derive valuable insights.

Data 52
article thumbnail

Data Mining: use cases & benefits

Apiumhub

Over the last decade, advances in processing power and speed have enabled us to move beyond manual, tedious and time-consuming practices to quick, easy and automated data analysis. The more complex the data sets collected, the more potential there is to uncover relevant insights. What is Data Mining.

article thumbnail

Research Results: Key software architecture metrics

Apiumhub

A good place to start is to instrument existing build pipelines so you can capture the four key metrics and make the software delivery value stream visible. GoCD pipelines, for example, provide the ability to measure these four key metrics as a first-class citizen of the GoCD analytics. So, today is the day! Structural Debt Index.

Metrics 95
article thumbnail

“Why Are My Tests So Slow?” A List of Likely Suspects, Anti-Patterns, and Unresolved Personal Trauma

Honeycomb

Instrument your build pipeline with spans and traces so you can see where all your time is going. Pipeline, pipeline, pipeline tests… with care and intention. Importing test data, seeding databases, sometimes multiple times. Not using a containerized build pipeline. FIFTEEN MINUTES?” Nope, and nnnnope.

Testing 52