Startups

Deep Science: AI adventures in arts and letters

Comment

Robotic arm carrying a mechanical part
Image Credits: Alashi / Getty Images (Image has been modified)

There’s more AI news out there than anyone can possibly keep up with. But you can stay tolerably up to date on the most interesting developments with this column, which collects AI and machine learning advancements from around the world and explains why they might be important to tech, startups or civilization.

To begin on a lighthearted note: The ways researchers find to apply machine learning to the arts are always interesting — though not always practical. A team from the University of Washington wanted to see if a computer vision system could learn to tell what is being played on a piano just from an overhead view of the keys and the player’s hands.

Audeo, the system trained by Eli Shlizerman, Kun Su and Xiulong Liu, watches video of piano playing and first extracts a piano-roll-like simple sequence of key presses. Then it adds expression in the form of length and strength of the presses, and lastly polishes it up for input into a MIDI synthesizer for output. The results are a little loose but definitely recognizable.

Diagram showing how video of a piano player's hands on the keys is turned into MIDI sequences.
Image Credits: Shlizerman, et. al

“To create music that sounds like it could be played in a musical performance was previously believed to be impossible,” said Shlizerman. “An algorithm needs to figure out the cues, or ‘features,’ in the video frames that are related to generating music, and it needs to ‘imagine’ the sound that’s happening in between the video frames. It requires a system that is both precise and imaginative. The fact that we achieved music that sounded pretty good was a surprise.”

Another from the field of arts and letters is this extremely fascinating research into computational unfolding of ancient letters too delicate to handle. The MIT team was looking at “locked” letters from the 17th century that are so intricately folded and sealed that to remove the letter and flatten it might permanently damage them. Their approach was to X-ray the letters and set a new, advanced algorithm to work deciphering the resulting imagery.

Diagram showing x-ray views of a letter and how it is analyzed to virtually unfold it.
Diagram showing X-ray views of a letter and how it is analyzed to virtually unfold it. Image Credits: MIT

“The algorithm ends up doing an impressive job at separating the layers of paper, despite their extreme thinness and tiny gaps between them, sometimes less than the resolution of the scan,” MIT’s Erik Demaine said. “We weren’t sure it would be possible.” The work may be applicable to many kinds of documents that are difficult for simple X-ray techniques to unravel. It’s a bit of a stretch to categorize this as “machine learning,” but it was too interesting not to include. Read the full paper at Nature Communications.

Diagram showing reviews of electric car charge points are analyzed and turned into useful data.
Image Credits: Asensio, et. al

You arrive at a charge point for your electric car and find it to be out of service. You might even leave a bad review online. In fact, thousands of such reviews exist and constitute a potentially very useful map for municipalities looking to expand electric vehicle infrastructure.

Georgia Tech’s Omar Asensio trained a natural language processing model on such reviews and it soon became an expert at parsing them by the thousands and squeezing out insights like where outages were common, comparative cost and other factors.

“Given the massive investment in electric vehicle infrastructure, we’re doing it in a way that is not necessarily attentive to the social equity and distributional issues of access to this enabling infrastructure,” said Asensio. Where better to study those issues than in unsolicited feedback from the people most affected?

Sudden interruptions to service, power or other necessaries can also lead to drones falling out of the air. The more built-in safeties we have in such circumstances, the better — and they can’t rely on anything like control signals or GPS. University of Zurich researchers have shown that a damaged drone with nothing more than a camera and working CPU can retain a pretty good amount of control.

A busted rotor from a collision or mechanical problem can cause a quadcopter to spin wildly and crash. But the Swiss team, led by Sihao Sun, showed that an onboard camera can perform very quick analysis of its surroundings as it spins, estimating its position based on the surroundings zooming by its view.

IEEE Spectrum has more information and an interview with Sun if you want to learn more (and why wouldn’t you?).

The ability to analyze imagery fast is a common asset in today’s AI systems. This has come into play in the medical imaging industry as well, where examinations and instruments can produce more images than a single doctor or even several specialists can be expected to scrutinize closely in a short time frame.

Echocardiograms, or ultrasound images of the heart, are no exception. A single session can yield thousands of images, any one of which might provide the clear image needed by a doctor to form a good idea of the heart’s condition. A team at Geisinger Research showed that AI can help sort through these images and aid doctors in the process of diagnosis and prognosis. The paper published in Nature Biomedical Engineering showed that doctors assisted by the system were 13% more accurate in predicting mortality.

The enormous (nearly 50 million images total) dataset on which it was trained will likely lead to more advances — the discovery here is of the possibility of using an unstructured imagery database like this one to produce decision-aiding AI, not the limits of those possibilities.

The problem of dealing with large amounts of data is mitigated somewhat when that data can be checked by humans. For instance, an image recognition algorithm that sorts pictures of cats from those of dogs can easily have its results audited by a human because we all know what dogs and cats look like.

But what if the neural network is looking at something humans don’t intuitively understand — like DNA sequences? It’s hard to say whether a system works well if the people making it aren’t confident about their ability to monitor it.

Peter Koo and Matt Ploenzke at Cold Spring Harbor Laboratory looked into ways of making machine learning systems made to analyze genomic sequences a bit more transparent to humans. It involves strong training of one layer of the convolutional neural network with known, familiar patterns so that the network uses those as points of reference in its analysis later. These improvements to interpretability seem to be independent of overall model effectiveness, so Koo speculates that if designed right, there should be no real trade-off.

Back to the arts and letters theme. Interpretability is as important when AIs make mistakes as when they get good results. A strange case appeared recently as CMU researchers showed that YouTube and other major implementers of natural language processing may be mistakenly flagging some chatter as inappropriate due to a misunderstanding of chess terminology.

Close-Up Of Chess Pieces On White Background
Image Credits: Ahmad Hairi Mohamed/EyeEm (opens in a new window) / Getty Images

It suddenly seems obvious when you consider that chess is often discussed in terms of white versus black, and in certain constructions — “white made a savage attack on black and drove him back” or the like — a computer with no real understanding of these terms might think something is awry.

It’s important not just so people can discuss chess freely, but because companies like YouTube need to be able to understand, and explain to their users, why increasingly AI-powered moderation processes make the decisions they do.

Finally, an experiment along those lines showcasing the limits of AI understanding. The famous example of an image analysis system mistaking grass for sheep is a valuable lesson, but another way of looking at what an AI pays attention to is to simply progressively delete more and more of an image and see if it can still recognize it.

Animated image showing parts of a landscape being deleted until an AI no longer recognizes it.
Image Credits: Shinseungback Kimyonghun

This is more of an art project than science (indeed it is by Korean artists Shinseungback Kimyonghun), but the implications are very interesting. When you’ve taken away everything that an AI recognizes about a scene, what is left? In some cases, the landscape is, in a way, almost as clear to the human eye as it was to start with. It’s a reminder of how different our mode of perception is from the machine learning systems we’ve created to ape it.

Create a handbook and integrate AI to onboard remote employees


Early Stage is the premier ‘how-to’ event for startup entrepreneurs and investors. You’ll hear first-hand how some of the most successful founders and VCs build their businesses, raise money and manage their portfolios. We’ll cover every aspect of company-building: Fundraising, recruiting, sales, product market fit, PR, marketing and brand building. Each session also has audience participation built-in – there’s ample time included for audience questions and discussion.

More TechCrunch

Tech enthusiasts and entrepreneurs, the clock is ticking! With just 72 hours remaining until the early-bird ticket deadline for TechCrunch Disrupt 2024, now is the time to secure your spot…

72 hours left of the Disrupt early-bird sale

Avendus, the top investment bank for venture deals in India, confirmed on Wednesday it is looking to raise up to $350 million for its new private equity fund.  The new…

Avendus, India’s top venture advisor, confirms it’s looking to raise a $350 million fund

China has closed a third state-backed investment fund to bolster its semiconductor industry and reduce reliance on other nations, both for using and for manufacturing wafers — prioritizing what is…

China’s $47B semiconductor fund puts chip sovereignty front and center

Apple’s annual list of what it considers the best and most innovative software available on its platform is turning its attention to the little guy.

Apple’s Design Awards nominees highlight indies and startups, largely ignore AI (except for Arc)

The spyware maker’s founder, Bryan Fleming, said pcTattletale is “out of business and completely done,” following a data breach.

Spyware maker pcTattletale says it’s ‘out of business’ and shuts down after data breach

AI models are always surprising us, not just in what they can do, but what they can’t, and why. An interesting new behavior is both superficial and revealing about these…

AI models have favorite numbers, because they think they’re people

On Friday, Pal Kovacs was listening to the long-awaited new album from rock and metal giants Bring Me The Horizon when he noticed a strange sound at the end of…

Rock band’s hidden hacking-themed website gets hacked

Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company’s approach to AI safety, has joined OpenAI rival Anthropic to lead a…

Anthropic hires former OpenAI safety lead to head up new team

Welcome to TechCrunch Fintech! This week, we’re looking at the long-term implications of Synapse’s bankruptcy on the fintech sector, Majority’s impressive ARR milestone, and more!  To get a roundup of…

The demise of BaaS fintech Synapse could derail the funding prospects for other startups in the space

YouTube’s free Playables don’t directly challenge the app store model or break Apple’s rules. However, they do compete with the App Store’s free games.

YouTube’s free games catalog ‘Playables’ rolls out to all users

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the first months of 2024. Smaller-sized…

15 hours ago
A comprehensive list of 2024 tech layoffs

OpenAI has formed a new committee to oversee “critical” safety and security decisions related to the company’s projects and operations. But, in a move that’s sure to raise the ire…

OpenAI’s new safety committee is made up of all insiders

Time is running out for tech enthusiasts and entrepreneurs to secure their early-bird tickets for TechCrunch Disrupt 2024! With only four days left until the May 31 deadline, now is…

Early bird gets the savings — 4 days left for Disrupt sale

AI may not be up to the task of replacing Google Search just yet, but it can be useful in more specific contexts — including handling the drudgery that comes…

Skej’s AI meeting scheduling assistant works like adding an EA to your email

Faircado has built a browser extension that suggests pre-owned alternatives for ecommerce listings.

Faircado raises $3M to nudge people to buy pre-owned goods

Tumblr, the blogging site acquired twice, is launching its “Communities” feature in open beta, the Tumblr Labs division has announced. The feature offers a dedicated space for users to connect…

Tumblr launches its semi-private Communities in open beta

Remittances from workers in the U.S. to their families and friends in Latin America amounted to $155 billion in 2023. With such a huge opportunity, banks, money transfer companies, retailers,…

Félix Pago raises $15.5 million to help Latino workers send money home via WhatsApp

Google said today it’s adding new AI-powered features such as a writing assistant and a wallpaper creator and providing easy access to Gemini chatbot to its Chromebook Plus line of…

Google adds AI-powered features to Chromebook

The dynamic duo behind the Grammy Award–winning music group the Chainsmokers, Alex Pall and Drew Taggart, are set to bring their entrepreneurial expertise to TechCrunch Disrupt 2024. Known for their…

The Chainsmokers light up Disrupt 2024

The deal will give LumApps a big nest egg to make acquisitions and scale its business.

LumApps, the French ‘intranet super app,’ sells majority stake to Bridgepoint in a $650M deal

Featured Article

More neobanks are becoming mobile networks — and Nubank wants a piece of the action

Nubank is taking its first tentative steps into the mobile network realm, as the NYSE-traded Brazilian neobank rolls out an eSIM (embedded SIM) service for travelers. The service will give customers access to 10GB of free roaming internet in more than 40 countries without having to switch out their own existing physical SIM card or…

22 hours ago
More neobanks are becoming mobile networks — and Nubank wants a piece of the action

Infra.Market, an Indian startup that helps construction and real estate firms procure materials, has raised $50M from MARS Unicorn Fund.

MARS doubles down on India’s Infra.Market with new $50M investment

Small operations can lose customers by not offering financing, something the Berlin-based startup wants to change.

Cloover wants to speed solar adoption by helping installers finance new sales

India’s Adani Group is in discussions to venture into digital payments and e-commerce, according to a report.

Adani looks to battle Reliance, Walmart in India’s e-commerce, payments race, report says

Ledger, a French startup mostly known for its secure crypto hardware wallets, has started shipping new wallets nearly 18 months after announcing the latest Ledger Stax devices. The updated wallet…

Ledger starts shipping its high-end hardware crypto wallet

A data protection taskforce that’s spent over a year considering how the European Union’s data protection rulebook applies to OpenAI’s viral chatbot, ChatGPT, reported preliminary conclusions Friday. The top-line takeaway…

EU’s ChatGPT taskforce offers first look at detangling the AI chatbot’s privacy compliance

Here’s a shoutout to LatAm early-stage startup founders! We want YOU to apply for the Startup Battlefield 200 at TechCrunch Disrupt 2024. But you’d better hurry — time is running…

LatAm startups: Apply to Startup Battlefield 200

The countdown to early-bird savings for TechCrunch Disrupt, taking place October 28–30 in San Francisco, continues. You have just five days left to save up to $800 on the price…

5 days left to get your early-bird Disrupt passes

Venture investment into Spanish startups also held up quite well, with €2.2 billion raised across some 850 funding rounds.

Spanish startups reached €100 billion in aggregate value last year

Featured Article

Onyx Motorbikes was in trouble — and then its 37-year-old owner died

James Khatiblou, the owner and CEO of Onyx Motorbikes, was watching his e-bike startup fall apart.  Onyx was being evicted from its warehouse in El Segundo, near Los Angeles. The company’s unpaid bills were stacking up. Its chief operating officer had abruptly resigned. A shipment of around 100 CTY2 dirt bikes from Chinese supplier Suzhou…

2 days ago
Onyx Motorbikes was in trouble — and then its 37-year-old owner died