首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The authors tried to establish quantitative and qualitative acoustic parameters of a good voice, suitable for future voice professionals. In their work they used long-time average spectrum analysis (LTAS) and three-dimensional analysis of periodicity (3D-PAN). They consider the regression straight line of formant regions and the parameters offered by 3D-PAN--jitter first of all--as the main acoustic parameters for the evaluation of voice quality and draw attention to the fact that acoustic parameters represent only one part of the evaluation of voice quality.  相似文献   

2.
The research described in this article had 2 aims: to permit greater precision in the conduct of naming experiments and to contribute to a characterization of the motor execution stage of speech production. The authors report an exhaustive inventory of consonantal and postconsonantal influences on delayed naming latency and onset acoustic duration, derived from a hand-labeled corpus of single-syllable consonant-vowel utterances. Five talkers produced 6 repetitions each of a set of 168 prepared monosyllables, a set that comprised each of the consonantal onsets of English in 3 vowel contexts. Strong and significant effects associated with phonetic characteristics of initial and noninitial phonemes were observed on both delayed naming latency and onset acoustic duration. Results are discussed in terms of the biomechanical properties of the articulatory system that may give rise to these effects and in terms of their methodological implications for naming experiments. (PsycINFO Database Record (c) 2011 APA, all rights reserved)  相似文献   

3.
The authors examined the interaction of acoustic and lexical information in lexical access and segmentation. The cross-modal lexical priming technique was used to determine which word meanings listeners access at the offsets of oronyms (e.g., tulips or two lips) presented in connected speech. In Experiment 1, participants showed priming by the meaning of tulips when presented with two lips. In Experiment 2, priming by the meaning of the 2nd word was found in such sequences (e.g., lips in two lips). Finally, Experiment 3 demonstrated that listeners do not show priming by lips when it is pronounced as part of tulips. The results of these experiments show that listeners sometimes access words other than those intended by speakers and may simultaneously access words associated with several parses of ambiguous sequences. Furthermore, the results suggest that acoustic marking of word onsets places constraints on the success of lexical access. To account for these results, the authors propose a new model of lexical access and segmentation. (PsycINFO Database Record (c) 2011 APA, all rights reserved)  相似文献   

4.
This study reports 4 experiments that investigated the locus of temporal effects of printed word frequency in speeded-naming tasks. Response latencies and onset durations are shorter for high-frequency words compared with low-frequency words, but there is no effect of frequency on rime durations. These results can only be accounted for if (a) phonemes are activated in parallel and not sequentially from left to right and (b) the criterion to initiate pronunciation is based on the initial phoneme and not the whole word. In addition, the effect of word-initial phoneme characteristics on acoustic latency was investigated. The acoustic latency of words beginning with voiceless sibilants was less than that of words beginning with plosives, a pattern opposite that reported by R. Treiman, J. Mullennix, R. Bijeljac-Babic, and E. E. Richmond-Welty (1995). This difference was attributed to the lower sensitivity of voice keys compared with measures based on digitized responses. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

5.
Sound onsets are salient and behaviorally relevant, and most auditory neurons discharge spikes locked to such transients. The acoustic parameters of sound onsets that shape such onset responses are unknown. In this paper is analyzed the timing of spikes of single neurons in the primary auditory cortex of barbiturate-anesthetized cats to the onsets of tone bursts. By parametric variation of sound pressure level, rise time, and rise function (linear or cosine-squared), the time courses of peak pressure, rate of change of peak pressure, and acceleration of peak pressure during the tones' onsets were systematically varied. For cosine-squared rise function tones of a given frequency and laterality, any neuron's mean first-spike latency was an invariant and inverse function of the maximum acceleration of peak pressure occurring at tone onset. For linear rise function tones, latency was an invariant and inverse function of the rate of change of peak pressure. Thus latency is independent of rise time or sound pressure level per se. Latency-acceleration functions, obtained with cosine-squared rise function tones under different stimulus conditions (frequency, laterality) from any given neuron and across the neuronal pool, were of strikingly similar shape. The same was true for latency-rate of change of peak pressure functions obtained with linear rise function tones. Latency-acceleration/rate of change of peak pressure functions could differ in their extent and in their position within the coordinate system. The positional differences reflect neuronal differences in minimum latency Lmin and in a sensitivity S to acceleration and rate of change of peak pressure (transient sensitivity), a hitherto unrecognized neuronal property that is distinctly different from firing threshold. Estimates of Lmin and S, which were derived by fitting a simple function to the neuronal latency-acceleration/rate of change of peak pressure functions, were independent of rise function. On average, Lmin decreased with increasing characteristic frequency (CF), but varied widely for neurons with the same CF. S varied with CF in a fashion similar to the cat's audiogram and, for a given neuron, varied with frequency. SD of first-spike latency was roughly proportional to the slope of the functions relating latency to acceleration/rate of change of peak pressure. Thus SD increased exponentially, rather than linearly, with mean latency, and did so at about twice the rate for linear than for cosine-squared rise function tones. The proportionality coefficients were quite similar across the neuronal pool and similar for both rise functions. Minimum SD increased nonlinearly with increasing Lmin. These findings suggest a peripheral origin of S and a peripheral establishment of latency-acceleration/rate of change of peak pressure functions. Because of the striking similarity in the shapes of such functions across the neuronal pool, sound onsets will produce orderly and predictable spatiotemporal patterns of first-spike timing, which could be used to instantaneously track rapid transients and to represent transient features by partly scale-invariant temporal codes.  相似文献   

6.
Languages are known to exhibit universal restrictions on sound structure. The source of such restrictions, however, is contentious: Do they reflect abstract phonological knowledge, or properties of linguistic experience and auditory perception? We address this question by investigating the restrictions on onset structure. Across languages, onsets of small sonority distances are dispreferred (e.g., lb is dispreferred to bn). Previous research with aural materials demonstrates such preferences modulate the perception of unattested onsets by English speakers: Universally ill-formed onsets are systematically misperceived (e.g., lba → leba) relative to well-formed onsets (e.g., bn). Here, we show that the difficulty to process universally ill-formed onsets extends to printed materials. Auxiliary tests indicate that such difficulties reflect phonological, rather than orthographic knowledge, and regression analyses demonstrate such knowledge goes beyond the statistical properties of the lexicon. These findings suggest that speakers have abstract, possibly universal, phonological knowledge that is general with respect to input modality. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

7.
A new method is presented for the parameterization of glottal volume velocity waveforms that have been estimated by inverse filtering acoustic speech pressure signals. The new technique, Parameter for Spectral and Amplitude Features of the Glottal Flow (PSA), combines two features of voice production, the AC value and the spectral decay of the glottal flow, both of which contribute to changes in vocal loudness. PSA yields a single parameter that characterizes the glottal flow in different loudness conditions. By analyzing voices of 8 speakers it was shown that the new parameter correlates strongly with the sound pressure level of speech.  相似文献   

8.
The acoustic confusion effect is the finding that lists of to-be-remembered items that sound similar to one another are recalled worse than otherwise comparable lists of items that sound different. Previous work has shown that concurrent irrelevant speech and concurrent irrelevant tapping both reduce the size of this effect, suggesting similarities between the two manipulations. The authors assessed the relation between irrelevant speech and irrelevant tapping by correlating the disruption each causes to recall of similar- and dissimilar-sounding items. A significant correlation was obtained, indicating a relation between the two. The results indicate that researchers should be sensitive to changes in the magnitude of the effects rather than focusing exclusively on the presence or absence of particular effects. Implications for the 3 major explanations of the irrelevant speech effect are discussed. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

9.
The aims of this study were to investigate the adequacy of electronic voice keys for the purpose of measuring naming latency and to test the assumption that voice key error can be controlled by matching conditions on initial phoneme. Three types of naming latency measurements (hand-coding and 2 types of voice keys) were used to investigate effects of onset complexity (e.g., sat vs. spat) on reading aloud (J. R. Frederiksen & J. F. Kroll, 1976, A. H. Kawamoto & C. T. Kello, 1999). The 3 measurement techniques produced the 3 logically possible results: a significant complexity advantage, a significant complexity disadvantage, and a null effect. Analyses of the performance of each voice key are carried out, and implications for studies of naming latency are discussed. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

10.
BACKGROUND: Following total laryngectomy the voice is produced by esophageal speech as well as with voice prostheses by vibrations of pharyngeal mucosal folds. This pharyngeal sound normally has a significantly lower fundamental frequency than the healthy voice (men about 120 Hz, women about 240 Hz, pharyngeal voice about 70 Hz), which is a handicap especially for female laryngectomy patients. In order to improve the postlaryngectomy voice, a new type of voice prostheses containing an integrated sound-producing metallic reed element was developed (ADEVA Company, Lübeck, Germany). METHODS/PATIENTS: Thirty-five of these new sound-producing voice prostheses were tested in vitro for different prosthesis-specific physical parameters (pressure, flow, sound pressure, flow resistance, frequency range). In 15 voice prosthesis speakers, a sound-producing prosthesis was introduced during a routine outpatient visit. Besides measurement of the above mentioned physical parameters in patients with conventional and sound-producing prostheses, the resulting voice as also evaluated by means of a video recording. RESULTS: In vitro all prostheses with the metallic reed element produced a clear sound. Flow resistance of the prostheses was slightly elevated by the reed element. Insertion of the prostheses was hindered by the reed element. Period of uninterrupted sound production was prolonged after insertion of a sound-producing prosthesis and patients could speak on a lower pressure level, but the sound of the reed element was permanently distinguishable only in 6 of 15 patients. CONCLUSIONS: In principle a variation of the pharyngeal voice by means of a sound producing element, which is integrated into a voice prosthesis, is possible. The current design of the metallic reed element tested is not yet suitable for routine clinical use: 1. The reed element is too sensitive and is easily damaged during insertion, so the insertion device has to be improved. 2. The sound producing element is blocked by small amounts of tracheal secretions, so that this element should be replaceable separately without requiring removal of the silicone value (if possible by the patient himself). Prior to insertion of the sound producing voice prosthesis the maximum air flow through the shunt should be measured to determine if the patient can produce the necessary air flow for activation of the reed element. A further improvement for these special types of voice prostheses would be a sound producing element, which generates a variable frequency of sound. Limiting the patient to only one fundamental frequency creates a monotone, which does not sound naturally. Initial progress toward a sound-producing voice prostheses has been made. This should be followed by the necessary improvements in order to improve the feasibility of this design for routine clinical use.  相似文献   

11.
Following curative minimal-invasive laser resection of T1-T3 laryngeal carcinomas, patients were subjected to an intensive voice rehabilitation. Therapy was effected twice a day for approximately 2 months, utilizing the concept of functional voice therapy. Before and after rehabilitation, acoustic analyses were made by using two different computer-supported measuring systems: (1) the "multidimensional voice program" (Kay Elementrics Corp.) and (2) a novel software program developed in G?ttingen that includes a new voice quality parameter based on correlations between frequency bands. Acoustic analyses showed superiority of the glottal versus supraglottal compensatory phonation. Findings showed that not all acoustic parameters equally documented voice improvement after rehabilitation. The standard deviation of fundamental frequency was the only parameter showing a significant post-therapeutic improvement. A further suitable acoustic method proved to be the voice quality parameter that has been newly introduced by us. In contrast to analogous parameters of other methods, this approach is independent of the exact periodicity of the glottal excitation function, thus permitting reliable results to be obtained even with aphonic or heavily dysphonic voices.  相似文献   

12.
When people hear a sound (a “sound object” or a “sound event”) the perceived auditory space around them might modulate their emotional responses to it. Spaces can affect both the acoustic properties of the sound event itself and may also impose boundaries to the actions one can take with respect to this event. Virtual acoustic rooms of different sizes were used in a subjective and psychophysiological experiment that evaluated the influence of the auditory space perception on emotional responses to various sound sources. Participants (N = 20) were exposed to acoustic spaces with sound source positions and room acoustic properties varying across the experimental conditions. The results suggest that, overall, small rooms were considered more pleasant, calmer, and safer than big rooms, although this effect of size seems to disappear when listening to threatening sound sources. Sounds heard behind the listeners tended to be more arousing, and elicited larger physiological changes than sources in front of the listeners. These effects were more pronounced for natural, compared to artificial, sound sources, as confirmed by subjective and physiological measures. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

13.
Individuals with normal voice and patients with voice functional impairments undergone electrophysiological investigation of various parts of the hearing system, using tone audiometry, including the extended frequency band (10, 12, 14 and 16 kHz), as well as short- and long-latency acoustic evoked potentials (SLAEP and LLAEP). It was found out, that individuals with voice functional impairments had all of their hearing system's parts impaired to various extent, with more marked impairments in the central, rather than in the peripheral part of the hearing system. It was shown, that hearing at 4-8 kHz, as well as with the extended frequency band, especially at 14-16 kHz, time patterns of acoustic evoked potentials (latencies of waves III and V of SLAEP, the interpeak interval I-V, as well as the latency periods of the LLAEP components P2 and N2) could be useful in professional selection of individuals of voice and speech professions and for solving labor expertise matters. Of those individuals with normal voice but systematic vocal stress, 17.5% had impaired hearing at 14 and 16 kHz, as well as significant latency prolongation of the LLAEP wave N2 with tone stimulation at 1 and 4 kHz. Apparently, individuals of voice and speech professions should be referred to as the "risk" group. It may well be, that extended band audiometry and acoustic evoked potentials time patterns could be useful in determining the thresholds between the normality and pathology in voice dysfunctions.  相似文献   

14.
The consistent, but often wrong, impressions people form of the size of unseen speakers are not random but rather point to a consistent misattribution bias, one that the advertising, broadcasting, and entertainment industries also routinely exploit. The authors report 3 experiments examining the perceptual basis of this bias. The results indicate that, under controlled experimental conditions, listeners can make relative size distinctions between male speakers using reliable cues carried in voice formant frequencies (resonant frequencies, or timbre) but that this ability can be perturbed by discordant voice fundamental frequency (F?, or pitch) differences between speakers. The authors introduce 3 accounts for the perceptual pull that voice F? can exert on our routine (mis)attributions of speaker size and consider the role that voice F? plays in additional voice-based attributions that may or may not be reliable but that have clear size connotations. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

15.
V Wolfe  D Martin 《Canadian Metallurgical Quarterly》1997,30(5):403-15; quiz 415-6
The purpose of this study was to explore the acoustic discrimination and graded severity of three clinical voice types. Listeners classified 102 samples of dysphonic vowels /a/ and /i/ on the basis of voice types: breathy, hoarse, and strained. The vowels were analyzed acoustically with two measures of perturbation and 2 measures of spectral noise. Discriminant analysis showed that apriori, acoustic classifications of voice type were made with 92% accuracy using four acoustic parameters: (a) cepstral peak prominence (CPP), (b) jitter standard deviation (SD-J), (c) fundamental frequency (F0), and (d) standard deviation of signal-to-noise ratio (SD-SNR). Findings suggest that voice type is associated with the interaction of spectral noise, fundamental frequency, and signal irregularity, and that dysphonic severity is associated with similar parameters, regardless of voice type.  相似文献   

16.
17.
Analysis of acoustic interactions between animals in active choruses is complex because of the large numbers of individuals present, their high calling rates, and the considerable numbers of vocalizations that either overlap or show close temporal alternation. The authors describe a methodology for recording chorus activity in bullfrogs (Rana catesbeiana) using multiple, closely spaced acoustic sensors that provide simultaneous estimates of sound direction and sound characteristics. This method provides estimates of location of individual callers, even under conditions of call overlap. This is a useful technique for understanding the complexity of the acoustic scene faced by animals vocalizing in groups. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

18.
It is well-established that subjective judgments of perceived urgency of alarm sounds can be affected by acoustic parameters. In this study, the authors investigated an objective measurement, the reaction time (RT), to test the effectiveness of temporal parameters of sounds in the context of warning sounds. Three experiments were performed using a RT paradigm, with two different concurrent visuomotor tracking tasks simulating driving conditions. Experiments 1 and 2 show that RT decreases as interonset interval (IOI) decreases, where IOI is defined as the time elapsed from the onset of one sound pulse to the onset of the next. Experiment 3 shows that temporal irregularity between pulses can capture a listener's attention. These findings lead to concrete recommendations: IOI can be used to modulate warning sound urgency; and temporal irregularity can provoke an arousal effect in listeners. The authors also argue that the RT paradigm provides a useful tool for clarifying some of the factors involved in alarm processing. (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

19.
Contends that in the literature on the vocal expression of emotion, there is a discrepancy between reported high accuracy in vocal-auditory recognition and a lack of evidence for the acoustic differentiation of vocal expression. The latter is explained by (a) a paucity of research on voice quality, (b) neglect of the social signaling functions of affect vocalization, and (c) insufficiently precise conceptualization of the underlying emotional states. A component-patterning model of vocal affect expression is proposed that attempts to link the outcomes of antecedent event evaluation to biologically based response patterns. The likely phonatory and articulatory correlates of the physiological responses characterizing different emotional states are described in the form of 3 major voice types (narrow/wide, lax/tense, full/thin). Specific predictions about changes in acoustic parameters resulting from changing voice types are compared with the pattern of empirical findings yielded by a comprehensive survey of the literature on vocal cues in emotional expression. Although the comparison is largely limited to the lax/tense voice type (because acoustic parameters relevant to the other voice types have not yet been systematically studied), a high degree of convergence is revealed. (120 ref) (PsycINFO Database Record (c) 2010 APA, all rights reserved)  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号