bigbluebutton-Github/record-and-playback/core
Calvin Walton 1866eb7194 Perform captions generation in UTF-16 encoding
The indexes returned in recording events from BBB refer to positions
within a UTF-16 encoded string. Rather than attempt to untangle this in
the server (which might have a performance cost), it's easier to switch
the caption processing code to operate in UTF-16 encoding as well to
make it work consistently.

The PyICU library provides a UnicodeString type which is a UTF-16 string
similar to Java and JavaScript, but which supports all the python
indexing methods. It's fairly straightforwards to swap it in in place of
the types used previously, and works natively as an input to the ICU
line break iterator too.

Fixes #10531
2020-09-25 12:39:18 -04:00
..
features Remove a few matterhorn references that were missed. 2016-09-02 17:45:10 -04:00
lib Fix io deadlock in recording scripts process execution utilities 2020-09-23 16:16:57 -04:00
resources/raw - extract metadata from events.xml and generate index.html in publish dir for simple playback 2011-05-24 18:16:54 -04:00
scripts Perform captions generation in UTF-16 encoding 2020-09-25 12:39:18 -04:00
spec Include webrtc webcams in recording processing 2018-04-24 15:20:35 -04:00
systemd - run keep-events worker separate from archive-worker 2019-07-04 11:00:59 -07:00
Gemfile RaP: Add an optional test group which installs e.g. rubocop 2019-06-04 11:03:22 -04:00
Gemfile.lock Bump dependency for nokogiri version 2020-02-29 08:56:33 -06:00