Startups

Data collection isn’t the problem: It’s what companies are doing with it

Comment

Rear view of young man walking towards detour on red background
Image Credits: Klaus Vedfelt (opens in a new window) / Getty Images

Maxim Kharchenko

Contributor

Maxim Kharchenko is the director of fintech products at Rakuten Viber and is an expert in product leadership in the financial technology sector.

Data is a company’s most powerful asset. Yet, many businesses cannibalize this valuable asset by selling it to third parties when they should be using it to make their businesses stronger and more sustainable.

Nearly all digital businesses collect some type of data from their users, so there has been growing concern from privacy rights groups about how that data is used. Yet, data collection is not wrong in and of itself. It’s the why, how and what is done with it that matters most when it comes to building a profitable and sustainable business that simultaneously respects the privacy of its users.

In the majority of cases, there is no nefarious man behind the curtain collecting data for evil. Most companies rake in as much data as they can under the assumption that you never know when and how data might be useful at some point down the line.

Thankfully, this is starting to change, and data scientists at data-driven companies are leading the charge. Collecting data based on a vague hypothetical scenario indicates a lack of intuitive understanding of what kinds of data are actually important to have from users, but smart companies are rightly asking only for the data that is needed to provide products and services to the end users.

Making data work for you through AI and a data fabric

Instead of selling user data to make money, data-driven companies have opted to analyze this data to understand how to gain the most useful insights. Know Your Customer (KYC) initiatives are dependent on data, using artificial intelligence (AI) to analyze the information to uncover preferences that users might not be talking about in online reviews.

Companies like Pepsi are leading the way in using AI for consumer product development purposes, and digital businesses can and should follow suit. Online platforms that want to go this route should beef up their in-house capabilities by hiring more data scientists and AI experts.

In addition to helping improve customer experience by enabling better personalization and customization options, AI can assist in making the onboarding process smoother and seamless for products and services.

As data becomes more complex, companies are trying to make more efficient use of their troves of data by implementing a data fabric — an interconnected layer of data and processes that supports composite data and analytics, as well as their various components.

A data fabric lets companies reuse and combine different styles of data science, enabling them to reduce integration design time by up to 30%, deployment by up to 30% and support by as much as 70%. In addition, a data fabric allows firms to use existing skills and technologies from data hubs, data lakes and data warehouses, as well as introduce new approaches and tools for the future.

Companies that want to implement a data fabric should start by integrating machine learning algorithms into every level of data — from collecting the data to optimizing and cleaning it. They should use cloud technology and implement flexible configurations, unification and fast access to data. They will also need to understand their database orchestration processes and data flows and implement the end-to-end integration of their databases.

Fintechs and banks are using data fabric to protect data by managing the access to resources while also putting in place customizable and personalized product and service offers. Lloyds Banking Group, for example, uses a data fabric to analyze customer behavior and to improve its products and services.

Retail and grocery firms today use data fabrics to improve remote customer care services by analyzing customers’ requirements and demands, as well as logistics. Apple, for one, uses data fabrics to improve its customer care service and technology offerings. Even dating apps now use data fabrics to quickly access big data.

Using decision intelligence frameworks to optimize solutions

All companies collect data to improve their products and services to customers’ preferences and requirements. As a result, all companies should take a long and hard look at their data collection methods and motivations.

One way to do this is through decision intelligence (DI), which is a discipline that includes a wide range of solutions, including traditional data analytics, artificial intelligence and complex adaptive system applications. This intelligence is applied to individual decisions as well as decision sequences, grouping them into business processes and urgent decision-making networks.

Creating such structures allows organizations to get the information they need to stimulate business. Combined with the ability to lay out an overall data structure, engineering the analysis of solutions opens up new opportunities to rethink or redesign how a company can optimize these solutions to make them more accurate, reproducible and traceable.

Intelligence in decision-making can also be used to analyze data linked to the processes on a digital platform. For example, DI can help e-commerce businesses use data to understand the best way to redesign the user journey to minimize abandoned shopping carts.

Companies that want to use DI should have a clear vision and strategy for how the data will be used to improve products and services. They will need to hire or develop qualitative data scientists and experts and to organize stable data collection and data engineering processes. Finally, DI requires a company to correlate the collected data with hypotheses related to business goals.

Many data-driven companies currently use DI in their daily business. In the banking, finance and fintech industries, DI helps institutions analyze customer behavior, predict their requirements, solve issues, and customize products and services. For example, Morgan Stanley uses DI in its fund management platform and to improve decision-making for investments.

Retailers have employed DI to make decisions on pricing policies, predict customer behavior and optimize the supply chain. Amazon, for instance, uses both data fabrics and DI to optimize its supply chain. In healthcare, DI allows practitioners to analyze medical reports faster and enables doctors to more easily prioritize successful treatments.

Recognizing and predicting trends requires strategic data collection

Ideally, data-driven businesses should not be strategizing more than a year ahead; they should instead be analyzing and utilizing the collected data on a continuous basis. If this is baked into the workflow, changes to user wants and needs will be reflected in the data, and smart digital platforms will be able to see if they need to start pivoting or adjusting their offerings to users.

Companies need to be strategic in determining how much and what kind of data they collect. When companies collect too much data, much of it is irrelevant and simply makes it more difficult to sift through to find the relevant data. Therefore, developers should be key internal stakeholders in the evaluation of the quality of the data being collected for analysis.

When a business is conducting product-market fit research, the collected data should help to prove or disprove particular hypotheses that the company is testing, and it is up to the product and business developers to drive the data requests according to their hypotheses.

To further optimize the data collection process, companies should avoid building hypotheses and data collection methods based on the experience of their competitors. Determining product-market fit can only be accomplished when the data being collected specifically relates to the company’s own products and user base. Ultimately, ready-to-go tools — such as Power BI, Tableau or AutoML — in combination with data scientists’ skills at using Python, C# or MATLAB will help the company to use their data to make the best decisions.

Building a data-driven business without sacrificing user trust

Invading user privacy by collecting data just to sell it is an unimaginative waste of time and business intelligence, and it can irreparably damage a company’s relationship with its customers. Smart businesses are establishing relationships with users built on trust — trust that the data users are handing over will ultimately benefit them in the form of features and services that meet their ever-changing needs.

Building a data-driven digital business that respects user privacy doesn’t mean sacrificing profits. On the contrary, satisfying users by fully understanding their needs — through strategic data collection and the use of AI and decision intelligence — is the only way to ensure long-term profitability.

More TechCrunch

Lydia is splitting itself into two apps — Lydia for P2P payments and Sumeria for those looking for a mobile-first bank account.

Lydia, the French payments app with 8 million users, launches mobile banking app Sumeria

Cargo ships docking at a commercial port incur costs called “disbursements” and “port call expenses.” This might be port dues, towage, and pilotage fees. It’s a complex patchwork and all…

Shipping logistics startup Harbor Lab raises $16M Series A led by Atomico

AWS has confirmed its European “sovereign cloud” will go live by the end of 2025, enabling greater data residency for the region.

AWS confirms will launch European ‘sovereign cloud’ in Germany by 2025, plans €7.8B investment over 15 years

Go Digit, an Indian insurance startup, has raised $141 million from investors including Goldman Sachs, ADIA, and Morgan Stanley as part of its IPO.

Indian insurance startup Go Digit raises $141M from anchor investors ahead of IPO

Peakbridge intends to invest in between 16 and 20 companies, investing around $10 million in each company. It has made eight investments so far.

Food VC Peakbridge has new $187M fund to transform future of food, like lab-made cocoa

For over six decades, the nonprofit has been active in the financial services sector.

Accion’s new $152.5M fund will back financial institutions serving small businesses globally

Meta’s newest social network, Threads, is starting its own fact-checking program after piggybacking on Instagram and Facebook’s network for a few months.

Threads finally starts its own fact-checking program

Looking Glass makes trippy-looking mixed-reality screens that make things look 3D without the need of special glasses. Today, it launches a pair of new displays, including a 16-inch mode that…

Looking Glass launches new 3D displays

Replacing Sutskever is Jakub Pachocki, OpenAI’s director of research.

Ilya Sutskever, OpenAI co-founder and longtime chief scientist, departs

Intuitive Machines made history when it became the first private company to land a spacecraft on the moon, so it makes sense to adapt that tech for Mars.

Intuitive Machines wants to help NASA return samples from Mars

As Google revamps itself for the AI era, offering AI overviews within its search results, the company is introducing a new way to filter for just text-based links. With the…

Google adds ‘Web’ search filter for showing old-school text links as AI rolls out

Blue Origin’s New Shepard rocket will take a crew to suborbital space for the first time in nearly two years later this month, the company announced on Tuesday.  The NS-25…

Blue Origin to resume crewed New Shepard launches on May 19

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

In the coming months, Google says it will open up the Gemini Nano model to more developers.

Patreon and Grammarly are already experimenting with Gemini Nano, says Google

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June