File(s) under permanent embargo
Incorporating domain knowledge with video and voice data analysis in news broadcasts
conference contribution
posted on 2000-01-01, 00:00 authored by K Shearer, C Dorai, Svetha VenkateshSvetha VenkateshThis paper addresses the area of video annotation, indexing and retrieval, and shows how a set of tools can be employed, along with domain knowledge, to detect narrative structure in broadcast news. The initial structure is detected using low-level audio visual processing in conjunction with domain knowledge. Higher level processing may then utilize the initial structure detected to direct processing to improve and extend the initial classification.
The structure detected breaks a news broadcast into segments, each of which contains a single topic of discussion. Further the segments are labeled as a) anchor person or reporter, b) footage with a voice over or c) sound bite. This labeling may be used to provide a summary, for example by presenting a thumbnail for each reporter present in a section of the video. The inclusion of domain knowledge in computation allows more directed application of high level processing, giving much greater efficiency of effort expended. This allows valid deductions to be made about structure and semantics of the contents of a news video stream, as demonstrated by our experiments on CNN news broadcasts.
The structure detected breaks a news broadcast into segments, each of which contains a single topic of discussion. Further the segments are labeled as a) anchor person or reporter, b) footage with a voice over or c) sound bite. This labeling may be used to provide a summary, for example by presenting a thumbnail for each reporter present in a section of the video. The inclusion of domain knowledge in computation allows more directed application of high level processing, giving much greater efficiency of effort expended. This allows valid deductions to be made about structure and semantics of the contents of a news video stream, as demonstrated by our experiments on CNN news broadcasts.
History
Event
Workshop on Multimedia Data Mining (1st : 2000 : Boston, Mass.)Pagination
46 - 53Publisher
[Association for Computing Machinery]Location
Boston, Mass.Place of publication
[New York, N. Y.]Start date
2000-08-20Language
engPublication classification
E1.1 Full written paper - refereedCopyright notice
2000, The AuthorsEditor/Contributor(s)
S Simoff, O ZaïaneTitle of proceedings
MDM/KDD 2000 : Proceedings of the 1st Workshop on Multimedia Data MiningUsage metrics
Categories
No categories selectedLicence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC