Enterprise

MergeStat channels open source and SQL to bring ‘operational analytics’ to software engineering

Comment

MergeStat
Image Credits: MergeStat

A new open source startup is setting out to help software development teams glean deeper insights from their codebases, using SQL to query all the data sources they use in the software building process.

MergeStat, as the startup is known, has flown under the radar until now, but with plans to launch a commercial product on top of its existing open source project, the company today announced a $1.2 million pre-seed round of funding and gave some insights on where it is and where it’s going in the months ahead.

For context, MergeStat’s origins can be traced back to mid-2020 when the first commits to a project called Gitqlite were made, which was essentially an experiment that brought together SQLite and Git to make it easier to query historical data in code repositories.

“At the time, I was very interested in exploring the history of source code to learn about legacy codebases I was working in,” MergeStat founder and CEO Patrick DeVivo explained to TechCrunch. “Could Git history be used to determine the best people to contact for questions about certain features or parts of a codebase? As a way to identify ‘experts’ in certain areas of code, and provide aggregated context around who was responsible for what parts of the source code? Similarly, could it surface areas of high risk that were dependent on someone who no longer works on a project?”

In essence, it’s all about diving into code history — this includes querying basic elements such as commit history and displaying author metadata via the “Git blame” command, but its intention is to go far beyond this and enable developers to leverage SQL to ask questions about the code itself.

“Operational analytics”

Fast-forward to April 2021, and the commercial MergeStat company was officially born, with DeVivo going on to lure Josue Lopez from cloud giant Equinix to serve as chief operating officer (COO), as well as official co-founder.

“This has led us to where we are today, where our mission is to support operational analytics for software engineering teams,” DeVivo said. “If it’s involved in building or shipping software, we’d like to make it possible to query with SQL.”

Essentially, any tool that works with PostgreSQL — including most business intelligence (BI) and data visualization tools — works with MergeStat. The platform itself includes a management interface and a PostgreSQL database, with MergeStat synchronizing data from various software development lifecycle (SDLC) sources into the main PostgreSQL database. Users can then query that data from within MergeStat’s app, or connect it to a third-party tool such as Grafana, Tableau or Superset.

But what are the kinds of use cases that MergeStat might support? Well, at its core it’s about garnering insights from information that may be spread across different codebases and developer teams. For example, if a manager at a large enterprise wants to know how many teams — and which teams — have adopted a new tool, or how many codebases use a specific version of a programming language or library, they can use MergeStat to ask that. Alternatively, they might want to extract all the third-party dependencies or configuration file values, and again MergeStat could help here.

Knowing the answers to such questions are vitally important if a company is conducting a huge migration project, or if they’re figuring out their potential attack surface area where there is a known vulnerability in a particular dependency.

MergeStat in action. Image Credits: MergeStat

Other potential use cases include auditing and compliance, so that companies can follow proper procedures and best practices as part of a regulatory framework. For example, a service provider might need to demonstrate that they are properly managing their customers’ data as part of a SOC 2 audit — MergeStat can be used to gather and present this evidence, showing who has accessed a specific file or who has modified what code.

Competitive landscape

It’s worth pointing out here that it’s already possible to get answers to these questions, but this typically involves a manual process involving multiple screens and tools, and copying text into spreadsheets, which can be a resource-intensive process. MergeStat automates much of it by allowing engineers to ask questions via SQL, and viewing answers in dashboards, reports and alerts through BI tools.

“MergeStat can continuously answer these questions, as teams go about their normal work — the underlying data MergeStat accesses changes to reflect the updated state,” DeVivo added.

Example pull request (PR) data derived via MergeStat. Image Credits: MergeStat

There are also many SaaS tools out there that fulfill at least one segment of what MergeStat promises. For instance, engineering metrics is covered by the likes of LinearB or Jellyfish, while code search is a core component of Sourcegraph and GitHub itself. And in the audit and compliance sphere, there is Drata, Vanta and Laika, which integrate with GitHub for evidence gathering.

While these all bring value, MergeStat is betting that many engineering leads don’t want pre-built “canned” metrics and charts around subjective concepts such as “velocity” or “productivity.” MergeStat posits that many would prefer access to the underlying data across the software development lifecycle, with the flexibility to query it in ways that are relevant to their specific organization and use case.

“Every organization is different, and we believe giving them tools to work with their data, to craft more specific questions, leads to better outcomes,” DeVivo said. “We are positioning ourselves as a data infrastructure product and believe that giving ‘lower level’ access to the data involved in building and shipping software is generally useful for engineering organizations to operationalize it.”

Being open source, of course, is also a big part of MergeStat’s flexibility promise. It gives companies full control of their data and deployment, while they are able to slice-and-dice it however they see fit — locally on a laptop, if they like — to figure it all out before going all-in.

What’s next

While MergeStat is still pretty much an open source project for now, the company is currently working on a hosted cloud product and an enterprise-focused incarnation that can be self-hosted or deployed on any cloud of the customer’s choosing. Much of this will be built around its recently announced “PostgreSQL approach,” which involves synchronizing data into a Postgres database for powering queries further downstream.

In the build up to its commercial launch, MergeStat said that it’s already working with “a number of companies” in early tests, including the team at Equinix Metal, which DeVivo says is currently using a self-hosted MergeStat instance across 800 repositories.

MergeStat’s pre-seed round was led by OSS Capital, with participation from Caffeinated Capital and a slew of angel investors.

More TechCrunch

Instagram Threads is rolling out the ability for users to signal which sort of posts they wanted to see more or less of by swiping.

You can now customize your For You feed on Threads using swipes

The Japanese billionaire who commissioned SpaceX for a private mission around the moon on a Starship rocket has abruptly canceled the project, citing ongoing uncertainties around when the launch vehicle…

Japanese billionaire pulls plug on private ‘dearMoon’ lunar Starship mission

Malicious actors are abusing generative AI music tools to create homophobic, racist, and propagandic songs — and publishing guides instructing others how to do so. According to ActiveFence, a service…

People are using AI music generators to create hateful songs

As WWDC 2024 nears, all sorts of rumors and leaks have emerged about what iOS 18 and its AI-powered apps and features have in store.

What to expect from Apple’s AI-powered iOS 18 at WWDC

Dallas is the second city that Cruise is easing its way back into after pulling its entire U.S. fleet late last year.

GM’s Cruise is testing robotaxis in Dallas again

Featured Article

After raising $100M, AI fintech LoanSnap is being sued, fined, evicted

The company has been sued by at least seven creditors, including Wells Fargo.

3 hours ago
After raising $100M, AI fintech LoanSnap is being sued, fined, evicted

Featured Article

Sonos Ace review: A high-priced contender

The Ace are a contender in a crowded market, but they’re still in search of that magic bullet to truly let them stand out from the pack.

3 hours ago
Sonos Ace review: A high-priced contender

The change would see Instagram becoming more like the free version of YouTube, which requires users to view ads before and in the middle of watching videos.

Instagram confirms test of ‘unskippable’ ads

Commerce platform Shopify has acquired Checkout Blocks, allowing Shopify Plus merchants to make no-code customizations in their checkout to enhance customer experience and potentially boost sales.  Checkout Blocks, which debuted…

Shopify acquires Checkout Blocks, a checkout customization app

After the Digital Markets Act (DMA) forced Apple to allow third-party app stores for iOS in Europe, several developers have launched alternative stores, like the AltStore and MacPaw’s Setapp (currently…

Aptoide launches its alternative iOS game store in the EU

Time is relentless and, right now, it’s no friend to procrastination-prone early-stage startup founders. The application window for Startup Battlefield 200 (SB 200) at TechCrunch Disrupt 2024 slams shut in…

One week left: Apply to TC Disrupt Startup Battlefield 200

Cloudera, the once high-flying Hadoop startup, raised $1 billion and went public in 2018 before being acquired by private equity for $5.3 billion in 2021. Today, the company announced that…

Cloudera acquires Verta to bring some AI chops to its data platform

The global spend management sector is experiencing a tailwind of sorts. North America is arguably the biggest market in this space, but spend management companies have seen demand rise across…

Spend management startup SiFi raises $10M to grow further in Saudi Arabia

Neural Concept lets designers model how components will perform before they can be manufactured.

Swiss startup Neural Concept raises $27M to cut EV design time to 18 months

The StrictlyVC roadtrip continues! Coming off of sold-out events in London, Los Angeles, and San Francisco, we’re heading to Washington, D.C. for a cozy-vc-packed, evening at the Woolly Mammoth Theatre…

Don’t miss StrictlyVC in DC next week

X will now allow users to post consensually produced NSFW content as long as it is prominently labeled as such.

X tweaks rules to formally allow adult content

Ashby consolidates existing talent acquisition tools and leans heavily on AI to automate the more repetitive steps in the recruitment pipeline.

Ashby injects recruiting with a dose of AI

Spotify has announced it’s hiking subscriptions for customers in the U.S., the second such price increase in the space of a year. The music-streaming giant reports that premium pricing will…

Spotify to increase premium pricing in the US to $11.99 per month

Monzo has announced its 2024 financial results, revealing its first full-year pre-tax profit. The company also confirmed that it’s in the early stages of expanding into the broader European market…

UK neobank Monzo reports first full (pre-tax) profit, prepares for EU expansion with Dublin hub

Featured Article

Inside Apple’s efforts to build a better recycling robot

Last week, TechCrunch paid a visit to Apple’s Austin, Texas, manufacturing facilities. Since 2013, the company has built its Mac Pro desktop about 20 minutes north of downtown. The 400,000-square-foot facility sits in a maze of industry parks, a quick trip south from the company’s in-progress corporate campus. In recent years, the capital city has…

12 hours ago
Inside Apple’s efforts to build a better recycling robot

Early attempts at making dedicated hardware to house artificial intelligence smarts have been criticized as, well, a bit rubbish. But here’s an AI gadget-in-the-making that’s all about rubbish, literally: Finnish…

Binit is bringing AI to trash

Temasek has previously invested in Lenskart, and this new funding follows a $500 million investment by the Abu Dhabi Investment Authority last year.

Temasek, Fidelity buy $200M stake in Lenskart at $5B valuation

Less than one year after its iOS launch, French startup ten ten has gone viral with a walkie talkie app that allows teens to send voice messages to their close…

French startup ten ten reinvents the walkie-talkie

Featured Article

Unicorn-rich VC Wesley Chan owes his success to a Craigslist job washing lab beakers

While all of Wesley Chan’s success has been well-documented over the years, his personal journey…not so much. Chan spoke to TechCrunch about the ways his life impacts how he invests in startups.

1 day ago
Unicorn-rich VC Wesley Chan owes his success to a Craigslist job washing lab beakers

Presumptive Republican presidential nominee Donald Trump now has an account on the short-form video app that he once tried to ban. Trump’s TikTok account, which launched on Saturday night, features…

Trump takes off on TikTok

With fewer than 400,000 inhabitants, Iceland receives more than its fair share of tourists — and of venture capital.

Iceland’s startup scene is all about making the most of the country’s resources

Kobo put out a handful of new e-readers a few weeks back: color versions of the excellent Libra 2 and Clara, as well as an updated monochrome version of the…

Kobo’s new e-readers are a sidegrade most can skip (with one exception)

In an interview at his home near Reykjavík, the entrepreneur-turned-VC shared thoughts on his ventures and the journey that led him from Unity to climate tech, a homecoming of sorts.

Unity co-founder David Helgason’s next act: Gaming the climate crisis

Welcome back to TechCrunch’s Week in Review — TechCrunch’s newsletter recapping the week’s biggest news. Want it in your inbox every Saturday? Sign up here. Over the past eight years,…

Fisker collapsed under the weight of its founder’s promises

What is AI? We’ve put together this non-technical guide to give anyone a fighting chance to understand how and why today’s AI works.

WTF is AI?