LIVE Public Results
Up one levelThis is a complete list of all the LIVE public results currently available for download. The entries are in chronological order. The list includes scientific papers, reports, publications, press releases, and presentations. For alerts on future entries use the RSS feed feature in the page header.
- Recommender System for the Multi-Channel TV Production
- This paper presents the concept of content recommendations for the production of multi-channel TV shows. Within the IST FP6 project "LIVE – Live Staging of Media Events" we are developing a production support system which will have a functionality of content recommendations and will support production of multi-channels programs. The paper outlines a concept of a recommender system for the multi-channel TV production and presents basic architecture and workflows within the system. The recommendation of the archive content for a given channel is personalized by taking into account the profile of the target audience.
- LIVE - a system for consumer-personalised production of TV programmes
- This paper presents the concept of viewer feedback in the production of multi-channel TV shows. Within the IST project LIVE; Live Staging of Media Events we are developing a production support system which will have a functionality of content recommendations and will support production of multi-channels programs. The paper outlines a concept of a recommender and feedback system for the multi-channel TV production and presents examples of the feedback tools within the TV production office. The LIVE system enables the Director to track the preferences of the TV viewers in real-time, during the live production of the show, and on the other hand give the viewers the possibility to actively influence the TV content.
- Ein Ansatz zur Unterstützung traditioneller Klassifikation durch Social Tagging
- Der vorliegende Beitrag stellt einen Ansatz zur Kombination von traditionellen, geschlossenen Klassifikationsverfahren mit offenen, auf Social Tagging basieren-den Klassifikationsverfahren vor. Die Darstellung geht von den grundsätzlichen Anforderungen an die Suche und Navigation in Dokumentenarchiven aus, erörtert die Vor- und Nachteile von geschlossenen und offenen Klassifikationsansätzen und präsentiert schließlich einen kombinierten Lösungsansatz, der im Rahmen ei-nes Prototypen umgesetzt wurde. Der Lösungsansatz sieht vor, dass Dokumente grundsätzlich mit freien Tags klas-sifiziert werden können: Die Klassifikation wird jedoch durch ein kontrolliertes Vokabular unterstützt. Freie Tags werden in einem nachgeordneten, moderierten Prozess in das kontrollierte Vokabular übernommen. Das auf diese Weise wach-sende und laufend gepflegte Vokabular unterstützt die Suche und Navigation im Dokumentenraum.
- Bringing “Intelligence” to iTV: The Intelligent Media Framework
- This paper gives an overview of a software frame¬work designed for the creation of interactive multi-channel television shows. The “Intelligent Media Framework” forms the middleware of an iTV production support system developed in the context of the European integrated project “LIVE”. The framework is designed according to service oriented architecture (SOA) principles for easy integration into existing iTV and TV production environments. Moreover, the Intelligent Media Frame-work is based on a knowledge model formalising the main aspects identified to make up the domain of real-time staging of media events: the content (media clips and media streams), the events, the staging and the users (professional users and consumers). The framework offers services for the development of tools assisting the production team of multi-channel iTV shows in an intelligent way: The envisaged “intelligence” is based on formal, machine understandable descriptions of the content and the events: This document introduces the knowledge model and provides an overview of the architecture of a media framework designed to support the iTV production process in an intelligent way.
- Face Detection based Neural Networks using Robust Skin Color Segmentation
- This paper proposes a robust schema for face detection system via Gaussian mixture model to segment image based on skin color. After skin and non skin face candidates’ selection, features are extracted directly from discrete cosine transform (DCT) coefficients computed from these candidates. Moreover, the back-propagation neural networks are used to train and classify faces based on DCT feature coefficients in Cb and Cr color spaces. This schema utilizes the skin color information, which is the main feature of face detection. DCT feature values of faces, representing the data set of skin/non-skin face candidates obtained from Gaussian mixture model are fed into the back-propagation neural networks to classify whether the original image includes a face or not. Experimental results shows that the proposed schema is reliable for face detection, and pattern features are detected and classified accurately by the backpropagation neural networks.
- Bringing 'Intelligence' to iTV: The Intelligent Media Framework - Poster
- Poster presentation of the role of the intelligent media framework in the LIVE project. It also provides and overview of the technologies and services of the intelligent media framework.
- LIVE Newsletter Issue 3
- Welcome to the third LIVE Newsletter. The LIVE production system will be tested at ORF (Austrian Broadcasting Corporation) during the Beijing Olympic Games. A total of 500 Austrian households will view and interact with the "LIVE Olympic Show". Over the two-week period a total of four interlinked channels will be produced. If successful LIVE could change the way we view live events such as the Olympics, the FIFA World Cup or a political election - on a permanent basis. Beyond the clear advantage of having fuller coverage of the event itself, those irritating moments of not knowing about the details of a sporting event—e.g. details about the contestants, the history behind it or, information on the venue—will be conveniently dispensed with by the...
- LIVE Olympic Trial Press Release Brochure
- The LIVE production system will be tested at ORF (Austrian Broadcasting Corporation) during the Beijing Olympic Games. A total of 500 Austrian households will view and interact with the "LIVE Olympic Show". Over the two-week period a total of four interlinked channels will be produced. If successful LIVE could change the way we view live events such as the Olympics, the FIFA World Cup or a political election - on a permanent basis. Beyond the clear advantage of having fuller coverage of the event itself, those irritating moments of not knowing about the details of a sporting event—e.g. details about the contestants, the history behind it or, information on the venue—will be conveniently dispensed with by the power of this latest and pioneering broadcasting information technology. But what might even be more important: For the first time it will be possible to serve the always diverse moods of viewers by simultaneously offering multiple points of view on one and the same live event. As in real life there is always more than one story to be told.
- A Block-Edge-Pattern-Based Content Descriptor in DCT Domain
- In this correspondence, we describe a robust and effective content descriptor based on block-edge patterns extracted in discrete cosine transform domain, which is suitable for applications in JPEG or MPEG compressed images and videos. This content descriptor is constructed by a run-length edge-block histogram with three patterns including horizontal edge, vertical edge and no edge. In comparison with existing descriptors, the proposed features: 1) low-cost computing suitable for real-time implementation and high-speed processing of compressed videos; 2) robust to orientation changes such as rotation, noise, reverse, etc.; 3) operates in compressed domain. Extensive experiments support that the proposed content descriptor is effective in describing visual content, and achieves superior performances in terms of retrieval precision and recall rates.
- D5.2 Report On Live Human Annotation
- This document reports on human annotation within the LIVE project. First it gives an overview about different annotation types that are useful for the LIVE staging of media events. It then summarizes the requirements for manual annotation by collecting results from potential users, e.g. from discussions performed with broadcasters, reporters, editors and video jockeys (VJs). It defines the necessary content metadata needed within the LIVE system, gives an overview over existing tools and describes the tools developed for the LIVE project. Finally, user evaluations of the developed tools that were performed with professional users from the ORF are described at the end of the document.
- An efficient face image retrieval through DCT features
- This paper proposes a new simple method of DCT feature extraction that utilize to accelerate the speed and decrease storage needed in image retrieving process by the aim of direct content access and extraction from JPEG compressed domain. Our method extracts the average of some DCT block coefficients. This method needs only a vector of six coefficients per block over the whole image blocks In our retrieval system, for simplicity, an image of both query and database are normalized and resized from the original database based on the cantered position of the eyes, the normalized image equally divided into non overlapping 8X8 block pixel Therefore, each of which are associated with a feature vector derived directly from discrete cosine transform DCT. Users can select any query as the main theme of the query image. The retrieval images is the relevance between a query image and any database image, the relevance similarity is ranked according to the closest similar measures computed by the Euclidean distance. The experimental results show that our approach is easy to identify main objects and reduce the influence of background in the image, and thus improve the performance of image retrieval.
- Subsampling-based image watermarkng in compressed DCT domain
- In this paper, a new embedding strategy for watermarking is presented based on DC components of subimages in compressed discrete cosine transform (DCT) domain. These subimages are obtained through subsampling the host image. More robustness has been achieved when watermarks are embedded in perceptually significant DC components. Furthermore, the original image is not required in the extraction process. Experimental results show that the proposed scheme successfully makes the watermark perceptually invisible and robust for a wide range of attacks, including JPEG-loss compression, filtering, scaling, and cropping attacks.
- Knowledge Acquisition from Multimedia Content
- Proceedings of the First International Workshop, KAMC 2007 Genova, Italy, December 5, 2007. In recent years significant advances have been made in the area of automatic ex- traction of low-level features from audiovisual content. However, little progress has been achieved in the identification of high-level semantic features or the effective combination of semantic features derived from different modalities. Knowledge acquisition is becoming a key-enabling factor of the above tasks towards more scalable and reliable solutions, and thus its automation is becoming critical. As the deployment of knowledge enhances the robustness of extraction while on the other hand the continuous extraction of semantic information can enrich this knowledge, synergistic approaches that combine multimedia extraction and knowledge evolution in a bootstrapping common framework introduce new opportunities in semantic multimedia applications. Integration with additional sources of information, e.g. by using human annotation tools or real-time event services, may further simplify and disambiguate semantic multimedia information systems. Moreover, adaptation to a particular domain, for example to sports events, such as the Olympic games, is essential in order to reduce the complexity of multimedia analysis. In this context, unified modelling and representation of multimedia and domain-specific knowledge, ontology evolution, and standard and non-standard inference services for multimodal semantic knowledge fusion, form cutting edge technologies. The aim of this workshop is to intensify the exchange of ideas between the different research communities involved which range from multimedia analysis to reasoning with ontologies. The submitted contributions published in these proceedings therefore reflect current research in this area: the topics range from multimedia classification based on textual information, content based shot classification, feature extraction to image classification based on ontologies. The submitted papers cover different application domains, i.e. broadcasted news or legal documents. We would like to thank all members of the program committee for supporting us in the reviewing process, the organizers of the main conference SAMT 2007 to which this workshop was co-located - especially Yannis Avrithis, Michela Spagnuolu and Francesco Robbiano - for their kind support throughout the organizational process.We also would like to thank the authors for their willingness to revise their initial submissions based on the reviewers comments. Finally we would like to thank our invited speakers, Fabio Ciravegna and Alan Smeaton for their willingness to give a talk at our workshop.
- Real-time shot cut detection in compressed domain
- In this short paper, we propose a fast and simple shot cut detection algorithm, which directly operates in compressed domain and suitable for real-time implementation. The proposed algorithm exploits the existing MPEG techniques by examining the prediction status for each macro-block inside B frames and P frames. As a result, locating both abrupt and dissolved shot cuts is operated by a sequence of comparison tests, and thus no feature extraction or histogram differentiation is needed. Although the description of the algorithm is primarily based on MPEG-1 and MPEG-2 streams, the scheme can be readily extended to other video compression standards such as MPEG-4 and H.264 by following the principle on monitoring: (i) balance between forward prediction and backward prediction; and (ii) boundaries among P, B and I frames. Extensive experiments illustrate that the proposed algorithm outperforms similar existing algorithm, providing a useful technique for fast and on-line video content processing.
- A Block-Edge-Pattern based Content Descriptor in DCT Domain
- In this correspondence, we describe a robust and effective content descriptor based on block edge patterns extracted directly in DCT domain, which is suitable for applications in JPEG or MPEG compressed images and videos. This content descriptor is constructed by a run-length edge-block histogram with three patterns including horizontal edge, vertical edge and no-edge. In comparison with existing descriptors, the proposed features: (i) low-cost computing suitable for real-time implementation and high-speed processing of compressed images or videos; (ii) robust to orientation changes such as rotation, noise, reverse etc. (iii) directly operates in compressed domain. Extensive experiments support that the proposed content descriptor is effective in describing visual content. In comparison with existing techniques, the proposed descriptor achieves superior performances in terms of retrieval precision and recall rates.
- Specifications of Concepts and Professional User Interfaces for Live Staging with Consumer Feedback
- In this document we specify concepts for Live Staging and the planning for Live Staging. In addition – where possible – procedures are defined which facilitate the development of a concrete Live Staging Concept by bringing the primarily artistic and intuitive approaches for the creation of live stories to a more formal level.
- LIVE Public Annual Report 2007
- This public annual report takes a look at some of the major activities and achievements of LIVE in 2007.
- Description of Online and Offline Metadata Extraction out of Sports Videos
- We focus on online and offline metadata extraction and annotation out of sports videos. The main benefit of our method is immediate and automatic extraction and annotation of metadata by giving semantics to combinations of heterogeneous low-level visual features. It brings new opportunities for efficient utilisation of sports video in improved ways, and is easily customized to address the characteristics. Firstly, semantic scene classification is described, including key-frames extraction, similarities determination between shots, and rule based estimation of scene boundaries. Secondly, fuzzy logic based categorizing is presented, including paradigm, Fuzzy membership function, and fuzzy feature generation and similarity measure. Thirdly, automatic sports video annotation is proposed, including robust dominant colour region detection, combined motion feature analysis. This work has been evaluated in the TRECVID 2007 competition.
- A New Robust Watermarking Scheme for Color Image in Spatial Domain
- This paper presents a new robust watermarking scheme for color image based on a block probability in spatial domain. A binary watermark image is permutated using sequence numbers generated by a secret key and Gray code, and then embedded four times in different positions by a secret key. Each bit of the binary encoded watermark is embedded by modifying the intensities of a non-overlapping block of 8*8 of the blue component of the host image. The extraction of the watermark is by comparing the intensities of a block of 8*8 of the watermarked and the original images and calculating the probability of detecting '0' or '1'. Tested by benchmark Stirmark 4.0, the experimental results show that the proposed scheme is robust and secure against a wide range of image processing operations.
- Face Detection based on Skin Color in Image by Neural Networks
- Face detection is one of the challenging problems in the image processing. A novel face detection system is prsented in this paper. The approach relies on skin-based color features extracted from two dimentional Discreate Cosine Transfer (DCT) and neural networks, which can be used to detect faces by using skin color from DCT coefficient of Cb and Cr feature vectors. This system contains the skin color which is the main feature of faces for detection, and then the skin face candidate is examined by using the neural networks, which learn from the feature of faces to classify whether the original image includes a face or not. The processing is based on normalization and Discreate Cosin Transfer. Finally the classification based on neural networks approch. The expriment results on upright frontal color face images from the internt show an exellent detection rate.
- Camera Motion Analysis towards Semantic-based Video Retrieval in Compressed Domain
- To reduce the semantic gap between low-level visual features and the richness of human semantics, this paper proposes new algorithms, by virtue of the combined camera motion descriptors with multi-threshold, to automatically retrieve the semantic concepts, i.e., close-up, and panorama, directly in MPEG compressed domain based on camera motion analysis. Extensive experiments illustrate that the proposed algorithms provide promising retrieval results under real-time application scenario and without human intervention
- Real-time and Automatic Close-up Retrieval from Compressed Videos
- In this paper, we propose a thorough scheme, by virtue of camera zooming descriptor with two-level threshold, to automatically retrieve close-ups directly from MPEG compressed videos based on camera motion analysis. In the retrieval process, we build camera-motion-based semantic retrieval. To improve the coverage of the proposed scheme, we investigate close-up retrieval in all kinds of videos. Extensive experiments illustrate that the proposed scheme provides promising retrieval results under real-time and automatic application scenario.
- The LIVE System Architecture
- This poster was produced for the IBC event in September 2007. It provides an overview of the various components in the LIVE system.
- Tools for a complex iTV future
- LIVE poster produced for the IBC event in September 2007
- There is always more than one story to be told
- LIVE poster produced for the IBC event in September 2007.
- Shift from the single to a multi-channel viewing experience
- LIVE poster produced for the IBC event in September 2007.
- Shaping Tomorrow's LIVE iTV Broadcast Experience
- Poster produced for the IBC Event in September 2007. Displays a user interaction scenario. The poster is only for promotional use and does not depict the actual LIVE user interface.
- Basic Specification of the Intelligent Media Framework (D7.4)
- The objective of this report is to provide a synopsis of the basic specification of the Intelligent Media Framework as developed during the first iteration cycle of the LIVE project (from January 2006 to June 2007). The implementation of this specification formed the middleware of the first prototype of the LIVE production support system. The Intelligent Media Framework is introduced to be based on a combination of a classical three-tier architecture with the principles of service oriented architectures (SOA). This report is made available for and addressed to, the interested public. It presupposes some basic knowledge of software and knowledge engineering as well as some understanding of broadcasting issues. Topics covered in this report are: - An overview of the architecture of the LIVE production support system - An overview of the LIVE staging domain and the requirements of different agents in the LIVE staging process - The knowledge and the framework requirements of the Intelligent Media Framework - The knowledge model of the Intelligent Media Framework comprising the knowledge structure, the term model, the event domain model and the basic IMA model. - The architecture of the Intelligent Media Framework (system design). - Initial conclusions and an assessment of the requirements.
- LIVE Flyer
- The LIVE project flyer provides a summarised look at the main areas of work in the project. Available as a PDF for download, size 2.6 MB. Designed in a three panel layout.
- Journal Paper JVRB 2007 Vol.4: Video Composer and Live Video Conductor: Future Professions for the Interactive Digital Broadcasting Industry
- Innovations in hardware and network technologies lead to an exploding number of non-interrelated parallel media streams. Per se this does not mean any additional value for consumers. Broadcasting and advertisement industries have not yet found new formats to reach the individual user with their content. In this work we propose and describe a novel digital broadcasting framework, which allows for the live staging of (mass) media events and improved consumer personalisation. In addition new professions for future TV production workflows which will emerge are described, namely the 'video composer' and the 'live video conductor'. online publication: http://www.jvrb.org/4.2007/1076/
- LIVE Newsletter 2
- This issue takes a closer look at the ideas and work behind the project. The point of departure is the LIVE technical view from which we will then explore in detail the individual LIVE system components. In short the LIVE approach represents a shift towards live broadcast shows that are dynamic and responsive to real-time events and changing needs of the audience.
- Short Project Description for EU Unit E2 brochure 2007
- Putting the viewer where the action is The LIVE project attempts to radically improve on the linear approach to TV broadcasting of live sporting events to deliver digital technologies and content formats that enable viewers to shape their own personal and highly interactive viewing experience as they watch the broadcast. The main idea of LIVE is to provide novel content production methods and new iTV video formats and services to enable interactive digital broadcasters to produce new non-linear multi-stream ‘shows’ to stage live media events such as the 2008 Olympic Games.
- Control Room Poster
- Poster used at the LIVE review in March 2007 to explain the role of the control room in the LIVE system
- Editor Room Poster
- Poster used at the LIVE review in March 2007 to explain the role of the editor room in the LIVE system
- Archives Poster
- Poster used at the LIVE review meeting in March 2007 to explain the role of the archives in the LIVE system
- Presentation on integration work in LIVE
- The following presentation is a public, shortened version of the presentation given at the review meeting in March 2007 at the ORF.
- LIVE: Bringing broadcasting to the next level / June Edition
- Broadcasting is changing. With the advent of everything from set-up boxes, IPTV and 3DTV what can we expect the viewing experience of the future to be like? One IST project which is gathering steam is LIVE, which could prove to be TV’s most interactive ‘real-time’ experience.
- LIVE Overview Review Presentation 220307
- The central idea of Live is to create: • Novel content production methods for live events • Tools for Interactive Digital Broadcasters • New ITV video formats and services • Non-linear, multi-stream formats to stage Live Media Events such as the 2008 Olympic Games
- Overview of LIVE Work Package 6, Personalisation and Feedback
- Public overview of objectives and results of the work package 6, Personalisation and Feedback.
- LIVE: Bringing broadcasting to the next level
- Broadcasting is changing. With the advent of everything from set-up boxes, IPTV and 3DTV what can we expect the viewing experience of the future to be like? One IST project which is gathering steam is LIVE, which could prove to be TV’s most interactive ‘real-time’ experience.
- Presentation of LIVE at the IST 2006
- This is an extract from the presentation given at the IST 2006 conference in Helsinki.
- Public Synopsis Initial LIVE Exploitation Plan (Deliverable 3.2)
- This document is a public synopsis of the first “Initial LIVE Exploitation Plan” which is the Part C of the overall Plan for Using and Disseminating Knowledge (PUDiK). This document is the first step to prepare the LIVE consortium for the exploitation of the project’s results; it provides a detailed work roadmap for the Phase I & II of the project and some initial guidelines for Phase III (M37-45).
- Public Synopsis Interactive Digital Television (Deliverable 3.5)
- This Public Synopsis on Interactive Digital Television provides an overview of the iDTV market including a short background introduction and outline of interactive features. The different sections in which the document is divided cover the different types of existing services and solutions as well as market trends.
- Annual Public Report (D1.3)
- This document gives a report on the work and the results of the first year (2006) of the project.
- Public Synopsis on Basic System Architecture (D9.3)
- The goal of this public deliverable is to provide a high-level overview of the idea of the LIVE project and its basic system architecture. The description goes to the level of detail that is needed to understand the basic architecture. For more detailed descriptions, particularly of the subsystems, the reader is referred to the respective subsystem deliverables. The described first basic system architecture of this document was developed including the results of the first six months of research within the LIVE project. Derived from the basic idea of a system - whereby an interactive digital broadcaster should be able to create a non-linear multi-stream video show in real-time, which changes due the consumers’ interests - first user tests were made and analyzed at the public Austrian broadcaster ORF (Österreichischer Rund-funk). These tests resulted in a set of initial requirements (compare deliverable D9.1 “Results from the initial requirement analysis”). Based on these requirements, actors of the LIVE Sys-tem and their basic use cases were identified. This finally results in the basic system architec-ture which is briefly described in this deliverable. The target audience for this document is any person inside or outside of the LIVE project in-terested in learning about the proposed functionality and architecture of LIVE.
- Public Video iTV Technology (Deliverable 3.7)
- This report tries to identify the companies, markets and environments surrounding the iTV industry. The report classifies the main actors depending on the nature of their product or service technology. In addition to the short description of each technology area, this document also includes chapter subdivisions analysing the possible implications of technology for the LIVE project.
- State of the Art Report Intelligent Media Framework (Deliverable 7.1)
- The integrated project “LIVE Staging of Media Events” (LIVE; FP6-27312) aims at the creation of novel intelligent content production methods and tools for interactive digital broadcasters to stage live media events in the area of sports, such as the 2008 Olympic Games. This report presents the state of the art of the concepts, technologies and standards related to one of the core subsystems developed in the LIVE project: The “Intelligent Media Framework” provides a robust framework for the creation, management and delivery of so called “Intelligent Media Assets” under real-time conditions. Topics covered in this report are: - Selected technologies in the area of (semantic) media asset management, recommender systems, metadata generation systems, video conducting systems and interface technologies. - Selected standards in the broadcasting domain and for knowledge representation. - Derived architectural requirements as well as requirements for content model of the envisaged Intelligent Media Framework - An assessment and comparison of selected intelligent content models.
- End-user interactive view
- A poster demonstrating end-user interactive concepts of the LIVE system
- LIVE press release at Salzburg Road Bicycle World World Championships in September 2006
- German Version: A LIVE press release to inform the industry of LIVE's objectives and participation at the Salzburg Road Bicycle World World Championships in September 2006. The LIVE project took advantage of the fact that its partner ORF as the host broadcaster of the UCI Road World Bicycle Championships in Salzburg to gain unique access to the live sporting production and broadcasting environment. This experience will be the basis for the first LIVE prototype testing phase in October 2006.
- LIVE press release at Salzburg Road Bicycle World World Championships in September 2006
- A LIVE press release to inform the industry of LIVE's objectives and participation at the Salzburg Road Bicycle World World Championships in September 2006. The LIVE project took advantage of the fact that its partner ORF as the host broadcaster of the UCI Road World Bicycle Championships in Salzburg to gain unique access to the live sporting production and broadcasting environment. This experience will be the basis for the first LIVE prototype testing phase in October 2006.
- Adding Lossless Video Compression to MPEGs
- In this correspondence, we propose to add a lossless compression functionality into existing MPEGs by developing a new context tree to drive arithmetic coding for lossless video compression. In comparison with the existing work on context tree design, the proposed algorithm features in 1) prefix sequence matching to locate the statistics model at the internal node nearest to the stopping point, where successful match of context sequence is broken; 2) traversing the context tree along a fixed order of context structure with a maximum number of four motion compensated errors; and 3) context thresholding to quantize the higher end of error values into a single statistics cluster. As a result, the proposed algorithm is able to achieve competitive processing speed, low computational complexity and high compression performances, which bridges the gap between universal statistics modeling and practical compression techniques. Extensive experiments show that the proposed algorithm outperforms JPEG-LS by up to 24% and CALIC by up to 22%, yet the processing time ranges from less than 2 seconds per frame to 6 seconds per frame on a typical PC computing platform.
- Constrained Region-Growing and Edge Enhancement Towards Automated Semantic Video Object Segmentation
- Most existing object segmentation algorithms suffer from a so-called under-segmentation problem, where parts of the segmented object are missing and holes often occur inside the object region. This problem becomes even more serious when the object pixels have similar intensity values as that of backgrounds. To resolve the problem, we propose a constrained region-growing and contrast enhancement to recover those missing parts and fill in the holes inside the segmented objects. Our proposed scheme consists of three elements: (i) a simple linear transform for contrast enhancement to enable stronger edge detection; (ii) an 8-connected linking regional filter for noise removal; and (iii) a constrained region-growing for elimination of those internal holes. Our experiments show that the proposed scheme is effective towards revolving the undersegmentation problem, in which a representative existing algorithm with edgemap based segmentation technique is used as our benchmark.
- DCT-Domain Image Retrieval Via Block-Edge-Patterns
- A new algorithm for compressed image retrieval is proposed in this paper based on DCT block edge patterns. This algorithm directly extract three edge patterns from compressed image data to construct an edge pattern histogram as an indexing key to retrieve images based on their content features. Three feature-based indexing keys are described, which include: (i) the first two features are represented by 3-D and 4-D histograms respectively; and (ii) the third feature is constructed by following the spirit of run-length coding, which is performed on consecutive horizontal and vertical edges. To test and evaluate the proposed algorithms, we carried out two-stage experiments. The results show that our proposed methods are robust to color changes and varied noise. In comparison with existing representative techniques, the proposed algorithms achieves superior performances in terms of retrieval precision and processing speed.
- An Effective and Fast Scene Change Detection Algorithm for MPEG Compressed Videos
- In this paper, we propose an effective and fast scene change detection algorithm directly in MPEG compressed domain. The proposed scene change detection exploits the MPEG motion estimation and compensation scheme by examining the prediction status for each macro-block inside B frames and P frames. As a result, locating both abrupt and dissolved scene changes is operated by a sequence of comparison tests, and no feature extraction or histogram differentiation is needed. Therefore, the proposed algorithm can operate in compressed domain, and suitable for real-time implementations. Extensive experiments illustrate that the proposed algorithm achieves up to 94% precision for abrupt scene change detection and 100% for gradual scene change detection. In comparison with similar existing techniques, the proposed algorithm achieves superiority measured by recall and precision rates.
- Video Indexing and Retrieval in Compressed Domain Using Fuzzy-Categorization
- There has been an increased interest in video indexing and retrieval in recent years. In this work, indexing and retrieval system of the visual contents is based on feature extracted from the compressed domain. Direct possessing of the compressed domain spares the decoding time, which is extremely important when indexing large number of multimedia archives. A fuzzycategorizing structure is designed in this paper to improve the retrieval performance. In our experiment, a database that consists of basketball videos has been constructed for our study. This database includes three categories: fullcourt match, penalty and close-up. First, spatial and temporal feature extraction is applied to train the fuzzy membership functions using the minimum entropy optimal algorithm. Then, the max composition operation is used to generate a new fuzzy feature to represent the content of the shots. Finally, the fuzzy-based representation becomes the indexing feature for the content-based video retrieval system. The experimental results show that the proposal algorithm is quite promising for semantic-based video retrieval.
- Afuzzy logic approach for detection of video shot boundaries
- Video temporal segmentation is normally the first and important step for content-based video applications. Many features including the pixel difference, colour histogram, motion, and edge information etc. have been widely used and reported in the literature to detect shot cuts inside videos. Although existing research on shot cut detection is active and extensive, it still remains a challenge to achieve accurate detection of all types of shot boundaries with one single algorithm. In this paper, we propose a fuzzy logic approach to integrate hybrid features for detecting shot boundaries inside general videos. The fuzzy logic approach contains two processing modes, where one is dedicated to detection of abrupt shot cuts including those short dissolved shots, and the other for detection of gradual shot cuts. These two modes are unified by a mode-selector to decide which mode the scheme should work on in order to achieve the best possible detection performances. By using the publicly available test data set from Carleton University, extensive experiments were carried out and the test results illustrate that the proposed algorithm outperforms the representative existing algorithms in terms of the precision and recall rates. 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
- EU-IST Project LIVE: Live Staging of Media Events
- Paper presented at the SAMT 2006: First International Conference on Semantics and Digital Media Technology, December 6 - 8, 2006, Athens, Greece.
- Methods, Design Guidelines and Workflows for Online Staging
- EU Information Society Technologies – FP6-27312, Report D4.3, EU-IST Project 'Live: Live Staging of Media Events', 2006. After describing the conceptual background which is necessary for the development of future live staging TV formats, this document proposes both visionary as well as first concrete methods and design guidelines for online staging. In addition considerations on the respective future workflows and the results of a first survey on suitable live video performance tools are presented.
- Identification of Dramaturgical Principles
- Identification of Dramaturgical Principles, Deliverable D2.2 of the MECiTV IST Research Project in FP5
- LIVE Newsletter Issue 1
- The aim of this publication is to report on the progress and results of the LIVE project as they happen. Each edition will briefly report on the progress of each work area as well as present a selection of articles from the consortium on a specific research theme being addressed in the project. This first edition provides an introductory overview of the nine main research and development work areas in the project, along with their respective research focus and expected outcomes.
- Series of LIVE postcards
- A series of 7 LIVE postcards promoting the main objectives of the project.