In early or preparatory design stages, an architect or designer sketches out rough ideas, not only about the object or structure being considered, but its relation to its spatial context. This is an iterative process, where the sketches are not only the primary means for testing and refining ideas, but also for communicating among a design team and to clients. Hence, sketching is the preferred media for artists and designers during the early stages of design, albeit with a major drawback: sketches are 2D and effects such as view perturbations or object movement are not supported, thereby inhibiting the design process. We present an interactive system that allows for the creation of a 3D abstraction of a designed space, built primarily by sketching in 2D within the context of an anchoring design or photograph. The system is progressive in the sense that the interpretations are refined as the user continues sketching. As a key technical enabler, we reformulate the sketch interpretation process as a selection optimization from a set of context‐generated canvas planes in order to retrieve a regular arrangement of planes. We demonstrate our system (available at http:/geometry.cs.ucl.ac.uk/projects/2016/smartcanvas/ ) with a wide range of sketches and design studies. 相似文献
Designing text-to-speech systems capable of producing natural sounding speech segments in different Indian languages is a challenging and ongoing problem. Due to the large number of possible pronunciations in different Indian languages, a number of speech segments are needed to be stored in the speech database while a concatenative speech synthesis technique is used to achieve highly natural speech segments. However, the large speech database size makes it unusable for small hand held devices or human computer interactive systems with limited storage resources. In this paper, we proposed a fraction-based waveform concatenation technique to produce intelligible speech segments from a small footprint speech database. The results of all the experiments performed shows the effectiveness of the proposed technique in producing intelligible speech segments in different Indian languages even with very less storage and computation overhead compared to the existing syllable-based technique. 相似文献
Easy-to-use audio/video authoring tools play a crucial role in moving multimedia software from research curiosity to mainstream
applications. However, research in multimedia authoring systems has rarely been documented in the literature. This paper describes
the design and implementation of an interactive video authoring system called Zodiac, which employs an innovative edit history abstraction to support several unique editing features not found in existing commercial
and research video editing systems. Zodiac provides users a conceptually clean and semantically powerful branching history model of edit operations to organize the authoring process, and to navigate among versions of authored documents. In addition,
by analyzing the edit history, Zodiac is able to reliably detect a composed video stream's shot and scene boundaries, which facilitates interactive video browsing.
Zodiac also features a video object annotation capability that allows users to associate annotations to moving objects in a video sequence. The annotations themselves could
be text, image, audio, or video. Zodiac is built on top of MMFS, a file system specifically designed for interactive multimedia development environments, and implements an internal buffer
manager that supports transparent lossless compression/decompression. Shot/scene detection, video object annotation, and buffer
management all exploit the edit history information for performance optimization. 相似文献
As biometric authentication systems become more prevalent, it is becoming increasingly important to evaluate their performance. This paper introduces a novel statistical method of performance evaluation for these systems. Given a database of authentication results from an existing system, the method uses a hierarchical random effects model, along with Bayesian inference techniques yielding posterior predictive distributions, to predict performance in terms of error rates using various explanatory variables. By incorporating explanatory variables as well as random effects, the method allows for prediction of error rates when the authentication system is applied to potentially larger and/or different groups of subjects than those originally documented in the database. We also extend the model to allow for prediction of the probability of a false alarm on a "watch-list" as a function of the list size. We consider application of our methodology to three different face authentication systems: a filter-based system, a Gaussian mixture model (GMM)-based system, and a system based on frequency domain representation of facial asymmetry 相似文献
OBJECTIVES: The objectives were to measure the impact of specific features of imaging devices on tasks relevant to minimally invasive surgery (MIS) and to investigate cognitive and perceptual factors in such tasks. BACKGROUND: Although image-guided interventions used in MIS provide benefits for patients, they pose drawbacks for surgeons, including degraded depth perception and reduced field of view (FOV). It is important to identify design factors that affect performance. METHOD: In two navigation experiments, observers fed a borescope through an object until it reached a target. Task completion time and object shape judgments were measured. In a motion perception experiment, observers reported the direction of a line that moved behind an aperture. A motion illusion associated with reduced FOV was measured. RESULTS: Navigation through an object was faster when a preview of the object's exterior was provided. Judgments about the object's shape were more accurate with a preview (compared with none) and with active viewing (compared with passive viewing). The motion illusion decreased with a rectangular or rotating octagonal viewing aperture (compared with circular). CONCLUSIONS: Navigation performance may be enhanced when surgeons develop a mental model of the surgical environment, when surgeons (rather than assistants) control the camera, and when the shape of the image is designed to reduce visual illusions. APPLICATION: Unintentional contact between surgical tools and healthy tissues may be reduced during MIS when (a) visual aids permit surgeons to maintain a mental model of the surgical environment, (b) images are bound by noncircular apertures, and (c) surgeons manually control the camera. 相似文献
With the rapid growth of the availability and popularity of interpersonal and behavior-rich resources such as blogs and other social media avenues, emerging opportunities and challenges arise as people now can, and do, actively use computational intelligence to seek out and understand the opinions of others. The study of collective behavior of individuals has implications to business intelligence, predictive analytics, customer relationship management, and examining online collective action as manifested by various flash mobs, the Arab Spring (2011) and other such events. In this article, we introduce a nature-inspired theory to model collective behavior from the observed data on blogs using swarm intelligence, where the goal is to accurately model and predict the future behavior of a large population after observing their interactions during a training phase. Specifically, an ant colony optimization model is trained with behavioral trend from the blog data and is tested over real-world blogs. Promising results were obtained in trend prediction using ant colony based pheromone classier and CHI statistical measure. We provide empirical guidelines for selecting suitable parameters for the model, conclude with interesting observations, and envision future research directions. 相似文献
In this paper, we propose elitist genetic algorithms–based artificial neural network (ANN) model for setting up an early warning system for occurrence of high inflation. The proposed warning system uses values of an appropriate set of economic fundamental variables as input and builds an ANN model for quantifying the possibility of high inflation within a fixed period of time window. Elitism-based generational genetic algorithm is used for optimizing the architecture of the ANN model. We empirically evaluate the proposed neuro-genetic approach to identify the class of leading economic indicators and build an early warning signalling system of an occurrence of high inflation (overall and component inflations) using the data from the Indian economy. We further compare the results of the proposed approach with the commonly used data-driven signals approach. In the empirical studies, we observe promising performance of the proposed neuro-genetic warning system, which is capable of generating accurate early warning signals of an impending high inflation.
T. A. Stoffregen, L. J. Smart, B. G. Bardy, and R. J. Pagulayan (1999) combined a postural task (upright stance) with a suprapostural task (visual fixation) to show that sway variability was not driven by optic flow in a task-independent manner (autonomous control) but governed by the demands of the supra-postural task (facilitatory control). The present study used a novel combination of Stoffregen et al.'s task conditions but obtained clear evidence of autonomous control and no indication of facilitatory control. The theoretical adequacy of the stabilization-by-looking versus stabilization-of-looking contrast was examined, as was emerging evidence that posture control and common cognitive tasks place concurrent demands on the same capacity-limited resources. An adaptive resource-sharing view of postural-suprapostural multitasking was proposed as an alternative to both the autonomous- and facilitatory-control views. (PsycINFO Database Record (c) 2010 APA, all rights reserved) 相似文献
In situ nitridation during laser deposition of titanium–molybdenum alloys from elemental powder blends has been achieved by introducing the reactive nitrogen gas during the deposition process. Thus, Ti–Mo–N alloys have been deposited using the laser engineered net shaping (LENSTM) process and resulted in the formation of a hard α(Ti,N) phase, exhibiting a dendritic morphology, distributed within a β(Ti–Mo) matrix with fine scale transformed α precipitates. Varying the composition of the Ar + N2 gas employed during laser deposition permits a systematic increase in the nitrogen content of the as-deposited Ti–Mo–N alloy. Interestingly, the addition of nitrogen, which stabilizes the α phase in Ti, changes the solidification pathway and the consequent sequence of phase evolution in these alloys. The nitrogen-enriched hcp α(Ti,N) phase has higher c/a ratio, exhibits an equiaxed morphology, and tends to form in clusters separated by ribs of the Mo-rich β phase. The Ti–Mo–N alloys also exhibit a substantial enhancement in microhardness due to the formation of this α(Ti,N) phase, combining it with the desirable properties of the β-Ti matrix, such as excellent ductility, toughness, and formability. 相似文献