Podcast Episodes Missing a Transcript

PandasEasyJunior level~10 min

Problem

A podcast platform stores one row per episode in the `episodes` DataFrame. The `transcript_score` column holds an accuracy rating once a transcript has been produced, and is missing (NaN) while an episode still has no transcript. Find every episode that has not been transcribed yet (`transcript_score` is missing). Return `episode_id`, `title`, `host` and `release_year`, ordered by `episode_id`.

Input data

Example rows — the live problem includes the full dataset.

episodes

episode_id	title	host	release_year	transcript_score
1	Tides of Industry	Maya R.	1925	4.5
2	Quiet Engines	Leo Park	1960
3	Garden of Signals	Ana Vora	1813	4.8
4	The Long Commute	D. Salim	1951
5	Paper Boats	Greta Osei	1945	4.2

Expected output

Your answer should return 3 rows with the columns episode_id, title, host, release_year.

Starter code (Pandas (Python))

import pandas as pd

def episodes_missing_transcript(episodes) -> pd.DataFrame:
    # Your code here
    return episodes

Solve this Pandas question free

Write Pandas (Python) and run it instantly in your browser — even on your phone. No signup needed to try.

Solve it now → Create free account

Solution & explanation

Create a free account to unlock the optimal solution, a step-by-step explanation, and the hidden test cases that grade your answer.