Podcast Episodes Missing a Transcript
Problem
A podcast platform stores one row per episode in the `episodes` DataFrame. The `transcript_score` column holds an accuracy rating once a transcript has been produced, and is missing (NaN) while an episode still has no transcript. Find every episode that has not been transcribed yet (`transcript_score` is missing). Return `episode_id`, `title`, `host` and `release_year`, ordered by `episode_id`.
Input data
Example rows — the live problem includes the full dataset.
| episode_id | title | host | release_year | transcript_score |
|---|---|---|---|---|
| 1 | Tides of Industry | Maya R. | 1925 | 4.5 |
| 2 | Quiet Engines | Leo Park | 1960 | |
| 3 | Garden of Signals | Ana Vora | 1813 | 4.8 |
| 4 | The Long Commute | D. Salim | 1951 | |
| 5 | Paper Boats | Greta Osei | 1945 | 4.2 |
Expected output
Your answer should return 3 rows with the columns episode_id, title, host, release_year.
Starter code (Pandas (Python))
import pandas as pd
def episodes_missing_transcript(episodes) -> pd.DataFrame:
# Your code here
return episodesSolve this Pandas question free
Write Pandas (Python) and run it instantly in your browser — even on your phone. No signup needed to try.
Solution & explanation
Create a free account to unlock the optimal solution, a step-by-step explanation, and the hidden test cases that grade your answer.
Sign up free to unlock