Digital Digest -> Software -> ccextractor -> History & Old Downloads

ccextractor -> Version History

This is the version history page for ccextractor, showing the last 20 software updates. You can return to the main page for this software by clicking here.


Version 0.88
  • Release Date: May 22, 2019
  • Download(s):
    • New: More tapping points for debug image in ccextractor.
    • New: Add support for tesseract 4.0
    • Optimize: Remove multiple RGB to grey conversion in OCR.
    • Fix: Update UTF8Proc to 2.2.0
    • Fix: Update LibPNG to 1.6.35
    • Fix: Update Protobuf-c to 1.3.1
    • Fix: Warn instead of fatal when a 0xFF marker is missing
    • Fix: Segfault in general_loop.c due to null pointer dereference (case of no encoder)
    • Fix: Enable printing hdtv stats to console.
    • Fix: Many typos in comments and output messages
    • Fix: Ignore Visual Studio temporary project files
    • New: Add support for non-Latin characters in stdout
    • Fix: Check whether stream is empty
    • New: Add support for EIA-608 inside .mkv
    • New: Add support for DVB inside .mkv
    • Fix: Added -latrusmap Map Latin symbols to Cyrillic ones in special cases
    • of Russian Teletext files (issue #1086)
    • Fix: Several OCR crashes
Version 0.87
  • Release Date: Jan 21, 2019
  • Download(s):
  • 0.87 (2018-10-23)
    New: Upgrade libGPAC to 0.7.1.
    New: mp4 tx3g & multitrack subtitles.
    New: Guide to update dependencies (docs/Updating_Dependencies.txt).
    New: Add LICENSE File (#959).
    New: Display quantisation mode in info box (#954).
    New: Add instruction required to build ccextractor with HARDSUBX support (#946).
    New: Added version no. of libraries to --version.
    New: Added -quant (OCR quantization function).
    New: Python API now compatible with Python 3.
    Fix: linux/builddebug: Added non-local directories to the incluye search path so we don't
    require a locally compiled tesseract or leptonica.
    Fix: Correct -HARDSUBX Bug In CMake, allow build with hardsubx using cmake (#966).
    Fix: possible segfaults in hardsubx_classifier.c due to strdup (#963).
    Fix: Improve the start and end timestamps of extracted burned in captions (#962).
    Fix: Update (#960).
    Fix: Fixed crash with "-out=report" and "-out=null".
    Fix: -nocf not working with OCR'ing (#958).
    Fix: segfault in add_cc_sub_text and initialize to NULL in init_encoder (#950).
    Fix: ccx_decoders_common.c: Copy data type when creating a copy of the subtitle structure.
    Fix: Implicit declaration of these functions throws warning during build (#948).
    Fix: ccx_decoders_common.c: Properly release allocated resources on free_subtitle().
    Fix: Added a datatype member to struct cc_subtitle - needed so we can properly free all
    memory when void *data points to a structure that has its own pointers.
    Fix: dvb_subtitle_decoder.c: When combining image regions verify that the offset is
    never negative.
    Fix: Updated traivis.yml to fix osx build (#947).
    Fix: Add utf8proc src file to cmake, updated header file (#944).
    Fix: Added required pointers on freep() calls.
    Fix: Removed dvb_debug_traces_to_stdout and used the usual dbg_print instead.
    Fix: Additional debug traces for DVB.
    Fix: Fix minor memory leak in ocr.c.
    Fix: Fix issue with displaying utf8proc version.
    Fix: Fix failing cmake due to liblept/tesseract header files.
    Fix: Added missing n in params.c.
    Fix: builddebug: Use -fsanitize=address -fno-omit-frame-pointer.
    Fix: ccx_decoders_common.c: Removed trivial memory leak.
    Fix: ccx_encoders_srt.c: Made sure a pointer is non-NULL before dereferencing.
    Fix: dvb_subtitle_decoder.c: Initialize pointer members to NULL when creating a structure.
    Fix: lib_ccx.c: Initialize (memset 0) structure cc_subtitle after memory allocation.
    Fix: Added verboseness to error/warnings in dvb_subtitle_decoder.c.
    Fix: dvb_subtitle_decoder.c: Work on passing invalid streams errors upstream (plus some
    warning messages) so we can eventually recover from this situation instead of crashing.
    Fix: telxcc.c: Currently setting a colour doesn't necessarily add a space even though the
    specifications mandate it. (#930).
    Fix: dvb_subtitle_decoder.c: Fix null pointer derefence when region==NULL in write_dvb_sub.
    Fix: DVB Teletext subtitle incomplete.
    Fix: replace all 0xA characters within startbox with 0x20.
    Fix: DVB Teletext subtitle incomplete (#922).
    Fix: Add missing return value to one of the returns in process_tx3g().
    Fix: Typos and other minor bugs.
    Fix: Tidy CMakeLists & vcxproj (#920).
    Fix: Added m2ts and -mxf to help screen.
    Fix: Added MKV to demuxer_print_cfg.
    Fix: Added MXF to demuxer_print_cfg.
    Fix: "Out of order packets" error had wrong print() parameters.
    Fix: Updated Python documentation.
    Fix: Fix incorrect path in XML (#904).
    Fix: linux build script (non-debug): Don't hide warnings from compiler.
    Fix: linux build script (debug): Display what's step of the build script we're in.
    Fix: Make the build reproducible (#976).
    Fix: Remove instance of o1 and o2 from help.
    Fix: Colors of DVB subtitles with depth 2 broken due to a missing break.
    Fix: CEA-708: Caption loss due to CW command (#991).
    Fix: CEA-708: Update patch for windows priority with functions (#990).

    0.86 (2018-01-09)
    New: Preliminary MXF support
    New: Added a histogram in one-minute increments of the number of lines in a subtitle.
    New: Added Autoconf build scripts for CCExtractor to generate makefiles (mac).
    New: Added Autoconf build scripts for CCExtractor to generate makefiles (linux).
    New: Added .rpm package generation script.
    New: Added build/installation script for .pkg.tar.xz (Arch Linux).
    New: Added tarball generation script.
    New: Added --analyzevideo. If present the video stream will be processed even if the
    subtitles are in a different stream. This is useful when we want video information
    (resolution, frame type, etc). -vides now implies this option too.
    [Note: Tentative - some possibly breaking changed were made for this, so if you
    use it validate results]
    New: Added a GUI in the main CCExtractor binary (separate from the external GUIs
    such as CCExtractorGUI).
    New: A Python binding extension so it's possible to use CCExtractor's tools from
    New: Added -nospupngocr (don't OCR bitmaps when generating spupng, faster)
    New: Add support for file split on keyframe (-segmentonkeyonly)
    New: Added WebVTT output from Matroska.
    New: Support for source-specific multicast.
    New: FreeType-based text renderer (-out=spupng with teletext/EIA608).
    New: Upgrade library UTF8proc
    New: Upgrade library win_iconv
    New: Upgrade library zlib
    New: Upgrade library LibPNG
    New: Support for Source-Specific Multicast
    New: Added Travis CI support
    New: Made error messages clearer, less ambiguous
    Fix: Prevent the OCR being initialized more than once (happened on multiprogram and
    PAT changes)
    Fix: Makefiles, build scripts, etc... everything updated and corrected for all
    -Fix: Proper line ending for .srt files from bitmaps.
    Fix: OCR corrections using grayscale before extracting texts.
    Fix: End timestamps in transcripts from DVB.
    Fix: Forcing -noru to cause deduplication in ISDB
    Fix: TS: Skip NULL packets
    Fix: When NAL decoding fails, don't dump the whole decoded thing, limit to 160 bytes.
    Fix: Modify Autoconf scripts to generate tarball for mac from /package_creators/
    and include GUI files in tarball
    Fix: Started work on libGPAC upgrade.
    Fix: DVB subtitle not extracted if there's no display segment
    Fix: Heap corruption in add_ocrtext2str
    Fix: bug that caused -out=spupng sometimes crashes
    Fix: Checks for text before newlines on DVB subtitles
    Fix: OCR issue caused by separated dvb subtitle regions
    Fix: DVB crash on specific condition (!rect->ocr_text)
    Fix: DVB bug (Multiple-line subtitle; Missing last line)
    Fix: --sentencecap for teletext samples
    Fix: Crash when image passed into OCR is empty
    Fix: Temporarily wrapped the Python API, not production ready yet
    Fix: -delay option in DVB
Version 0.85b Version 0.85
  • Release Date: Jan 25, 2017
  • Download(s):
  • 0.85 (2017-01-24)
    - New: Added FFMPEG 3.0 to Windows build - last one that is XP compatible.
    - New: Major improvements in CEA-608 to WebVTT (styles, etc).
    - New: Return a non-zero return code if no subtitles are found.
    - New: Windows build files updated to Visual Studio 2015, new target platform is 140_xp.
    - New: Added basic support of Tesseract 4.0.0.
    - New: Added build script for .deb.
    - New: Updated -debugdvbsub parameter to get the most relevant DVB traces for debugging.
    - New: SMPTE-TT files are now compatible with Adobe Premiere.
    - New: Updated libpng.
    - New: Added 3rd party (Tracy from static linux build script.
    - New: Add chapter extraction for MP4 files.
    - New: Return code 10 if no captions are found at all.
    - Fix: Teletext duplicate lines in certain cases.
    - Fix: Improved teletext timing.
    - Fix: DVB timing is finally good.
    - Fix: A few minor memory leaks.
    - Fix: tesseract library file included in mac build command.
    - Fix: Bad WTV timings in some cases.
    - Fix: Mac build script.
    - Fix: Memory optimization in HARDSUBX edit_distance.
    - Fix: SubStation Alpha subtitles in bitmap.
    - Fix: lept msg severity in linux.
    - Fix: SSA, SPUPNG and VTT timing and skipping of subtitles for SAMI and TTML.
    - Fix: SMPTE-TT : Added support for font color.
    - Fix: SAMI unnecessary empty subtitle when extracting DVB subs.
    - Fix: Skip the packet if the adaptation field length is broken.
    - Fix: 708 - lots of work done in the decoder. Implementation of more commands. Better timing.
    - Fix: Signal handlers.
Version 0.84
  • Release Date: Dec 17, 2016
  • Download(s):
  • 0.84 (2016-12-16)
    - New: In Windows, both with and without-OCR binaries are bundled, since the OCR one causes problems due to 
    dependencies in some system. So unless you need the OCR just use the non-OCR version.
    - New: Added -sbs (sentence by sentence) for DVB output. Each frame in the output file contains a complete
    sentence (experimental).
    - New: Added -curlposturl. If used each output frame will be sent with libcurl by doing a POST to that URL.
    - Fix: More code consistency checking in function names.
    - Fix: linux build script now tries to verify dependencies.
    - Fix: Mac build script was missing a directory.
Version 0.83
  • Release Date: Dec 15, 2016
  • Download(s):
  • - Fix: Duplicate lines in mp4 (specifically affects itunes).
    - Fix: Timing in .mp4, timing now calculated for each CC pair instead of per atom.
    - Fix: Typos everywhere in the documentation and source code.
    - Fix: CMakeLists for build in cmake.
    - Fix: -unixts option.
    - Fix: FPS switching messages.
    - Fix: Removed ugly debug statement with local path in HardsubX.
    - Fix: Changed platform target to v120_xp in Visual Studio (so XP is supported again).
    - Fix: Added detail in many error messages.
    - Fix: Memory leaks in videos with XDS.
    - Fix: Makefile compatibility issues with Raspberry pi.
    - Fix: missing separation between WebVTT header and body. 
    - Fix: Stupid bug in M2TS that preventing it from working.
    - Fix: OCR libraries dependencies for the release version in Windows.
    - Fix: non-buffered reading from pipes.
    - Fix: --stream option with stdin.
    - New: terminate_asap to buffered_read_opt
    - New: Added some TV-show specific spelling dictionaries.
    - New: Updated GPAC library.
    - New: ASS/SSA.
    - New: Capture sigterm to do some clean up before terminating.
    - New: Work on 708: Changed DefineWindow behavior, only clear text of an existing window is style has changed.
Version 0.82
  • Release Date: Aug 16, 2016
  • Download(s):
  • - New: HardsubX - Burned in subtitle extraction subsystem.
    - New: Color Detection in DVB Subtitles
    - Fix: Corrected sentence capitalization
    - Fix: Skipping redundant bytes at the end of tx3g atom in MP4
    - Fix: Illegal SRT files being created from DVB subtitles
    - Fix: Incorrect Progress Display
Version 0.81
  • Release Date: Jun 22, 2016
  • Download(s):
  • - New: --version parameter for extensive version information (version number, compile date, executable hash, git commit (if appropriate))
    - New: Add -sem (semaphore) to create a .sem file when an output file is open and delete it when it's closed.
    - New: Add --append parameter. This will prevent overwriting of exisiting files.
    - New: File Rotation support added. The user has to send a USR1 signal to rotate.
    - Fix: Issues with files <1 Mb
    - Fix: Preview of generated transcript.
    - Fix: Statistics were not generated anymore.
    - Fix: Correcting display of sub mode and info in transcripts.
    - Fix: Teletext page number displayed in -UCLA.
    - Fix: Removal of excessive XDS notices about aspect ratio info.
    - Fix: Force Flushing of file buffers works for all files now.
    - Fix: mp4 void atoms that was causing some .mp4 files to fail.
    - Fix: Memory usage caused by EPG processing was high due to many non-dynamic buffers.
    - Fix: Project files for Visual Studio now include OCR support in Windows.
Version 0.80
  • Release Date: Apr 26, 2016
  • Download(s):
  • - Fix: "Premature end of file" (one of the scenarios)
    - Fix: XDS data is always parsed again (needed to extract information such as program name)
    - Fix: Teletext parsing: @ was incorrectly exported as * - X/26 packet specifications in ETS 300 706 v1.2.1 now better followed
    - Fix: Teletext parsing: Latin G2 subsets and accented characters not displaying properly
    - Fix: Timing in -ucla
    - Fix: Timing in ISDB (some instances)
    - Fix: "mfra" mp4 box weight changed to 1 (this helps with correct file format detection)
    - Fix: Fix for TARGET File is null. 
    - Fix: Fixed SegFaults while parsing parameters (if mandatory parameter is not present in -outinterval, -codec or -nocodec)
    - Fix: Crash when input small is too small
    - Fix: Update some URLs in code (references to docs)
    - Fix: -delay now updates final timestamp in ISDB, too
    - Fix: Removed minor compiler warnings
    - Fix: Visual Studio solution files working again
    - Fix: ffmpeg integration working again
    - New: Added --forceflush (-ff). If used, output file descriptors will be flushed immediately after being written to
    - New: Hexdump XDS packets that we cannot parse (shouldn't be many of those anyway)
    - New: If input file cannot be open, provide a decent human readable explanation
    - New: GXF support
Version 0.79
  • Release Date: Jan 14, 2016
  • Download(s):
  • - Support for Grid Format (g608)
    - Show Correct number of teletext packet processed
    - Removed Segfault on incorrect mp4 detection
    - Remove xml header from transcript format
    - Help message updated for Teletext
    - Added --help and -h for help message
    - Added --nohtmlescape option
    - Added --noscte20 option
Version 0.78
  • Release Date: Dec 13, 2015
  • Download(s):
  • - Support to extract Closed Caption from MultiProgram at once.
    - CEA-708: exporting to SAMI (.smi), Transcript (.txt), Timed Transcript (ttxt) and SubRip (.srt).
    - CEA-708: 16 bit charset support (tested on Korean).
    - CEA-708: Roll Up captions handling.
    - Changed TCP connection protocol (BIN data is now wrapped in packets, added EPG support and keep-alive packets).
    - TCP connection password prompt is removed. To set connection password use -tcppassword argument instead.
    - Support ISDB Closed Caption.
    - Added a new output format, simplexml (used internally by a CCExtractor user, may or may not be useful for
      anyone else).

Version 0.77
  • Release Date: Jun 21, 2015
  • Download(s):
  • - Fixed bug in capitalization code ('I' was not being capitalized).
    - GUI should now run in Windows 8 (using the include .Net runtime, since 3.5 cannot be installed in Windows 8 apparently).
    - Fixed Mac build script, binary is now compiled with support for files over 2 GB.
    - Fixed bug in PMT code, damaged PMT sections could make CCExtractor crash.

Version 0.76
  • Release Date: Mar 29, 2015
  • - Added basic M2TS support
    - Added EPG support - you can now export the Program Guide to XML
    - Some bugfixes
Version 0.75
  • Release Date: Jan 16, 2015
  • Download(s):
  • - Fixed issue with teletext to other then srt.
    - CCExtractor can be used as library if compiled using cmake
    - By default the Windows version adds BOM to generated UTF files (this is because it's needed to open the files correctly) while all other builds don't add it (because it messes with text processing tools). 
      You can use -bom and -nobom to change the behaviour.
Version 0.74
  • Release Date: Sep 24, 2014
  • Download(s):
  • - Fixed issue with -o1 -o2 and -12 parameters (where it would write output only in the o2 file)
    - Fixed UCLA parameter issue. Now the UCLA parameter settings can't be overwritten anymore by later parameters that affect the custom transcript
    - Switched order around for TLT and TT page number in custom transcript to match UCLA settings
    - Added nobom parameter, for when files are processed by tools that can't handle the BOM. If using this, files might be not readable under windows.
    - Segfault fix when no input files were given
    - No more bin output when sending to server + possibility to send TT to server for processing
    - Windows: Added the Microsoft redistributable MSVCR120.DLL to both the installation package and the application zip.
Version 0.73
  • Release Date: Aug 19, 2014
  • Download(s):
  • - Added support of BIN format for Teletext
    - Added start of librarisation. This will allow in the future for other programs to use encoder/decoder functions and more.
Version 0.72
  • Release Date: Aug 13, 2014
  • Download(s):
  • - Fix for WTV files with incorrect timing
    - Added support for fps change using data from AVC video track in a H264 TS file.
Version 0.71
  • Release Date: Jul 29, 2014
  • Download(s):
  • - Added feature to receive captions in BIN format according to CCExtractor's own
      protocol over TCP (-tcp port [-tcppassword password])
    - Added ability to send captions to the server described above or to the
      online repository (-sendto host[:port])
    - Added -stdin parameter for reading input stream from standard input
    - Compilation in Cygwin using linux/Makefile
    - Fix for .bin files when not using latin1 charset
    - Correction of mp4 timing, when one timestamp points timing of two atom
Version 0.70
  • Release Date: Jul 7, 2014
  • Download(s):
  • This is the first release that is part of Google's Summer of Code.
    Anshul, Ruslan and Willem joined CCExtractor to work on a number of things
    over the summer, and their work is already reaching the mainstream 
    version of CCExtractor.

    - Added a huge dictionary submitted by Matt Stockard.
    - Added DVB subtitles decoder, spupng in output
    - Added support for cdt2 media atoms in QT video files. Now multiple atoms in
     a single sample sequence are supported.
    - Changed Makefile.
    - Fixed some bugs.
    - Added feature to print info about file's subtitles and streams (-out=report).
    - Support Long PMT.
    - Support Configuration file.
    - There is an sample configuration file in doc/ folder with name
    - Just now only ccextractor.cnf named files kept beside ccextractor
     executable is supported
    - for details of which options can be set using configuration file,
     please look at sample file.

    - Added options for custom transcript output:
    new parameter (-customtxt format), where the format must be like this: 1100100 (7 digits).
    These indicate whether the next things should be displayed  or not in the (timed) transcript:
    - Display start time
    - Display end time
    - Display caption mode
    - Display caption channel
    - Use a relative timestamp ( relative to the sample)
    - Display XDS info
    - Use colors
    0000101 is the default setting for transcripts
    1110101 is the default for timed transcripts
    1111001 is the default setting for -ucla
    Make sure you use this parameter after others that might affect these 
    settings (-out, -ucla, -xds, -txt, -ttxt, ...)

Version 0.69
  • Release Date: Apr 5, 2014
  • Download(s):
  • A few patches from Christopher Small, including proper support for multiple multicast clients listening on the same port.
  • GUI: Fixed teletext preview.
  • GUI: Added a small indicator of data being received when reading from UDP.
  • GUI: Added UTF-8 support to preview Window (used for teletext).
  • Fixes in Makefile and build script, compilation in linux and OSX failed if another libpng was found in the system.
  • WTV support directly in CCExtractor (no need for wtvccdump any more).
  • Started refactoring and clean-up.
  • Fix: MPEG clock rollover (happens each 26 hours) caused a time discontinuity.
  • Windows GUI: Started work on HDHomeRun support. For now it just looks for HDHomeRun devices. Lots of other things will arrive in the next versions.
  • Windows GUI: Some code refactoring, since the HDHomeRun support makes the code larger enough to require more than one source file :-)

Rate this software:

Your Rating:
You have not voted yet!

Average: 2.30
Total Ratings: 5


Software Submissions