Is it possible to pull the auto (non-user) generated video transcripts from any of the YouTube APIs?
Youtube's automatic captions are very easy to set up. 1) Select the ellipses in the Youtube interface menu below the video and click on the 3 horizontal dots below the video and then click on 'open transcript' button.
YouTube offers viewers the option to download a video's subtitles as text, provided that the video's creator has already added those subtitles to their video file.
As of Aug 2019 the following method you to download transcripts:
https://www.youtube.com/watch?v=[Video ID]
JSON.parse(ytplayer.config.args.player_response).captions.playerCaptionsTracklistRenderer.captionTracks[0].baseUrl
You may refer with this thread: How to get "transcript" in youtube-api v3
If you're authenticating with oAuth2, you could do a quick call to this feed:
http://gdata.youtube.com/feeds/api/videos/[VIDEOID]/captiondata/[CAPTIONTRACKID]
to get the data you want. To retrieve a list of possible caption track IDs with v2 of the API, you access this feed:
https://gdata.youtube.com/feeds/api/videos/[VIDEOID]/captions
That feed request also accepts some optional parameters, including language, max-results, etc. For more details, along with a sample that shows the returned format of the caption track list, see the documentation at https://developers.google.com/youtube/2.0/developers_guide_protocol_captions#Retrieve_Caption_Set
Also, here are some references which migh help:
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With