Windows, MacOS or Linux workflow would be fine, I am result-oriented.
Problem:
ffmeg -i <filename> sees the stream:
[....]
Stream #0:0[0x100]: Video: mpeg2video (Main) ([2][0][0][0] / 0x0002), yuv420p(tv, top first), 1440x1080 [SAR 4:3 DAR 16:9], 29.97 fps, 29.97 tbr, 90k tbn, 59.94 tbc
Stream #0:1[0x110]: Audio: aac (LC) ([15][0][0][0] / 0x000F), 48000 Hz, stereo, fltp, 255 kb/s
Stream #0:2[0x130]: Subtitle: arib_caption (Profile A) ([6][0][0][0] / 0x0006)
Stream #0:3[0x138]: Data: bin_data ([6][0][0][0] / 0x0006)
Stream #0:4[0x140]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:5[0x160]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:6[0x161]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:7[0x162]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:8[0x163]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:9[0x164]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:10[0x165]: Unknown: none ([13][0][0][0] / 0x000D)
Stream #0:11[0x111]: Audio: aac ([15][0][0][0] / 0x000F), 0 channels
[...]
and I can attempt to extract it using something like:
ffmpeg -y -i source.m2t -map 0: -vn -an -scodec copy -dn -ignore_unknown -f mpegts output.ts
however having poked around at the result with a hex editor, and tried to open the result as SHIFT-JIS and UTF-8, it is not obvious to me how to convert it to something readable.
ccextractor is not that useful either (or, more likely, I don't know how to use it):
user@host:~$ ccextractor source.m2t
CCExtractor 0.88, Carlos Fernandez Sanz, Volker Quetschke.
Teletext portions taken from Petr Kutalek's telxcc
--------------------------------------------------------------------------
Input: source.m2t
[Extract: 1] [Stream mode: Autodetect]
[Program : Auto ] [Hauppage mode: No] [Use MythTV code: Auto]
[Timing mode: Auto] [Debug: No] [Buffer input: No]
[Use pic_order_cnt_lsb for H.264: No] [Print CC decoder traces: No]
[Target format: .srt] [Encoding: UTF-8] [Delay: 0] [Trim lines: No]
[Add font color data: Yes] [Add font typesetting: Yes]
[Convert case: No] [Video-edit join: No]
[Extraction start time: not set (from start)]
[Extraction end time: not set (to end)]
[Live stream: No] [Clock frequency: 90000]
[Teletext page: Autodetect]
[Start credits text: None]
[Quantisation-mode: CCExtractor's internal function]
-----------------------------------------------------------------
Opening file: source.m2t
Detected MP4 box with name: meta
File seems to be a transport stream, enabling TS mode
Analyzing data in general mode
This TS file has more than one program. These are the program numbers found:
191
791
792
*****ISDB subtitles detected
*****ISDB subtitles detected
*****ISDB subtitles detected
*****ISDB subtitles detected
1% | 00:10*****ISDB subtitles detected
*****ISDB subtitles detected
84% | 25:35*****ISDB subtitles detected
*****ISDB subtitles detected
99% | 30:10*****ISDB subtitles detected
*****ISDB subtitles detected
100% | 30:25
Number of NAL_type_7: 0
Number of VCL_HRD: 0
Number of NAL HRD: 0
Number of jump-in-frames: 0
Number of num_unexpected_sei_length: 0
Min PTS: 15:46:36:674
Max PTS: 16:17:01:680
Length: 00:30:25:006
Done, processing time = 78 seconds
No captions were found in input.
Issues? Open a ticket here
Code: Select all
https://github.com/CCExtractor/ccextractor/issues
[/font]
Looking for "*****ISDB subtitles detected" I came across
Code: Select all
https://github.com/stz2012/libarib25/