dlib/examples/ffmpeg_video_demuxing2_ex.cpp

84 lines
2.4 KiB
C++
Raw Permalink Normal View History

[FFmpeg] decoding and demuxing improvements (#2784) * typo * - added compile time information to audio object. Not convinced this is needed actually. I'm perfectly happy just using the ffmpeg::frame object. I'm pretty sure I'm the only user who cares about audio. - created resizing_args and resampling_args * smaller videos for unit tests * shorter videos for unit tests * - decoder and demuxer: you now resize or resample at the time of read. therefore you don't set resizing or resampling parameters in constructor, but you pass them to read() - added templated read() function - simplified load_frame() * inherit from resizing_args and resampling_args * reorganised the tests to segragate decoding, demuxing, encoding and muxing as much as possible * much more basic example * demxing examples split * examples * fixing examples * wip * Fix load_frame() * added frame - specific tests * - makes sense to have a set_params() method rather than constructing a new object and moving. I mean, it works and it absolutely does the right thing, and in fact the same thing as calling set_params() now, but it can look a bit weird. * notes on defaults and good pairings * Update ffmpeg_demuxer.h Watch out for `DLIB_ASSERT` statements. Maybe one of the unit tests should build with asserts enabled. * Update ffmpeg_details.h * Update ffmpeg_muxer.h * WIP * WIP * - simplified details::resizer - added frame::set_params() - added frame::clear() - forward packet directly into correct queue * pick best codec if not specified * added image data * warn when we're choosing an appropriate codec * test load_frame() * - for some reason, you sometimes get warning messages about too many b-frames. Resetting pict_type suppresses this. - you can move freshly decoded frames directly out. * callback passed to push() * I think it's prettier this way * WIP * full callback API for decoder * updated tests * updated example * check the template parameter is callable and has 1 argument first before getting it's first argument * Potential bug fix * - write out the enable_if's explictly. It's fine. I think it's clear what's going on if someone cares - guard push() with a boolean which asserts when recursion is detected * pre-conditions on callbacks: no recursion --------- Co-authored-by: pf <pf@me> Co-authored-by: Your name <you@example.com>
2023-05-16 09:24:47 +08:00
// The contents of this file are in the public domain. See LICENSE_FOR_EXAMPLE_PROGRAMS.txt
/*
This is an example illustrating the use of the ffmpeg wrappers,
in this case the demuxer API.
In this example, we show how to read both images and audio.
*/
#include <cstdio>
#include <dlib/media.h>
#include <dlib/gui_widgets.h>
#include <dlib/cmd_line_parser.h>
using namespace std;
using namespace dlib;
using namespace dlib::ffmpeg;
int main(const int argc, const char** argv)
try
{
command_line_parser parser;
parser.add_option("i", "input video", 1);
parser.add_option("verbose", "enable all internal ffmpeg logging");
parser.set_group_name("Help Options");
parser.add_option("h", "alias of --help");
parser.add_option("help", "display this message and exit");
parser.parse(argc, argv);
const char* one_time_opts[] = {"i"};
parser.check_one_time_options(one_time_opts);
if (parser.option("h") || parser.option("help"))
{
parser.print_options();
return 0;
}
const std::string filepath = parser.option("i").argument();
// We use the most simple constructor, the one that takes in filepath.
// By default, both images and audio are read.
demuxer cap(filepath);
if (!cap.is_open())
{
printf("%s is not a valid video file\n", filepath.c_str());
return EXIT_FAILURE;
}
// Read images and audio.
// Images are plotted, audio samples are counted.
frame f;
array2d<rgb_pixel> img;
image_window win;
int nsamples{0};
// When reading frames, we get exactly what's in the codec by default.
// To resize, change pixel format, resample or change sample format,
// you have to pass extra arguments to read() which either resize or resample
// the frame. Since we want rgb_pixel, we need to set the pixel format appropriately.
const resizing_args args_image {0, 0, pix_traits<rgb_pixel>::fmt};
while (cap.read(f, args_image))
{
if (f.is_image())
{
convert(f, img);
win.set_image(img);
}
else if (f.is_audio())
{
nsamples += f.nsamples();
}
}
printf("Read %i audio samples\n", nsamples);
return EXIT_SUCCESS;
}
catch (const std::exception& e)
{
printf("%s\n", e.what());
return EXIT_FAILURE;
}