The purpose of this document is to provide descriptions of six core experiments on MPEG Compact Descriptors for Video Analysis (CDVA) (see N15339, "Call for Proposals for Compact Descriptors for Video Analysis (CDVA) - Search and Retrieval", June 2015, Warsaw, PL). The previous three CEs have been reviewed, and based on the responses received and the open issues they pose, one additional CE on matching and retrieval has been added, and the CEs related to deep-learning based descriptors have been split. The description of all CEs has been improved, adding more details on the experiments and expected results.
The results of experiments will be discussed on the relevant email reflector before the 118th MPEG meeting.
The report of each CE should include; (1) a comparison between the tested solutions, (2) a recommendation to the AhG based on the results of the CE.
The results according to the evaluation criteria described below will be reported (in particular those defined by the evaluation framework defined in N15729, "Evaluation Framework for Compact Descriptors for Video Analysis - Search and Retrieval – Version 2.0", October 2015, Geneva, CH), but results based on other evaluation criteria may be included.
If learning-based methods are used, then the CDVA data set must not be included for training. The data sets used for training shall be reported. The details of the network configuration being used shall be included in the CE response (or as a reference to an input document that contains these details).