High-speed kymography identifies the immediate effects of voiced vibration in healthy vocal folds

INTRODUCTION

Semi-occluded vocal tract exercises are used in the treatment of dysphonia and in vocal warm-ups by professional voice users. These exercises are performed with the aim of reducing and minimizing the collision stress caused by vocal fold vibrations (1). The voiced vibration technique is one exercise that has been widely used by speech therapists empirically (2, 3). Voiced vibrations are performed with fast and repeated oscillations of the lips or tongue by rapid passage of the expiratory airflow along with phonatory emission (4). The voiced vibration technique is indicated for most vocal disorders, and has attracted the attention of researchers over the last few years (3).

According to the literature, a greater nonlinear interaction between the vocal tract and glottal source and increased supra- and subglottic pressures exert positive effects on voiced vibrations by reducing the stress of voice production (1). The effects of vocal exercises are assessed by modern instrumentation for voice analisys (2). High-speed digital image-recording systems are the most current videoendoscopy methods used for voice studies. These systems allow reasearchers to observe the true vibrational patterns of the vocal folds (5, 6). High-speed kymography yields laryngeal images extracted from high-speed videoendoscopy, thereby allowing quantitative analyses of vocal-fold vibratory functions (5, 6, 7).

Analysis of digital kymographic images is performed through juxtaposition of the laryngeal images within the frames. This method a requires selection of the cutting position of the vocal fold images with the largest opening amplitude and subsequent measurement of the time phases, the opening and closing phases, the initial and latter phases of the vibratory cycle, and the amplitude of the vocal folds (5,6). This method is useful to investigate the effects of vocal exercises. The understunding of these effects is crucial in speech therapy; therefore, the goal of the present study was to identify the immediate effects of the voiced vibration exercise in healthy vocal folds by high-speed kymographic analysis.

METHOD

The study was approved by the Research Ethics Committee of the Federal University of São Paulo (UFSCar), protocol number 256/2010. All participants read and signed the Term of Free and Informed Consent, according to Resolution 196/96.

Participants

This was a prospective study conducted at the Voice Research Laboratory of the Hospital das Clínicas (HC-FMUSP) and School of Medicine, University of São Paulo. The inclusion criteria were the absence of medical reports for voice problems, no smoking and low alcohol consumption, no health problems during the voice testing, and normal perceptual quality of voice julged by a SLP voice specialist. Fifteen subjects (6 men and 9 women; ages ranging from 21 to 43 years) participated in this investigation.

Equipment

The data were extracted from high-speed videoendoscopy recordings before and after the voiced vibration exercises. Videoendoscopy data were acquired and stored by a system (Richard Wolf, Model HRES ENDOCAM 5562; Knittlingen, Germany) that acquires images at a rate of 4000 frames per second with a spatial resolution of 256x 256 pixels. A 90-degree rigid endoscope with a camera was used to record sequences of laryngeal images. The system is coupled to a computer and color video monitor (Figure 1).

The first high-speed videoendoscopy examination was performed before the vocal exercise. After the subject was seated in the examining chair, an endoscope was inserted into the subject's open mouth and advanced to the posterior pharynx. The participant was instructed to produce the vowel /E/ while the tongue was held by the clinician (Figure 1). Sustained phonation of the /E/ vowel was produced at normal pitch and loudness. The endoscope was adjusted such that it was parallel to the superior surface of the vocal folds. The recording time for each videoendoscopy session was 2 seconds (4000 frames/s).

Experimental Procedures

The next step was to perform the vocal exercise. Each participant was instructed by a speech therapist to produce fast and repetitive oscillations of the lips or tongue together with voice emission. The voiced vibrations were produced at habitual pitch and loudness. The female subjects performed the vocal exercise for 3 minutes, and the male subjects for 5 minutes (11).

The second high-speed videoendoscopy examination was performed after the voiced vibration exercise. In this evaluation, the rigid endoscope was maintained in the larynx at the same angle and distance as that used in the vocal exercise. During the data-collection procedure, the fundamental frequency and emission intensity were controlled. The fundamental frequency was controlled by a virtual keyboard of the SpeechPitch software and intensity was measured by a decibel meter (RadioShack, Model 33-2055; New York, USA) placed 30 cm from the labial commissure of the subject.

Analysis of High-Speed Kymograph Parameters

High-speed video recordings were converted into AVI files and processed by the software of the image recording system. The high-speed kymograph images were obtained by a line in the medial section of the laryngeal image for the capture and juxtaposition of the vocal fold images over time (Figure 2) (5). The midpoint was chosen due to its larger vocal fold mobility characteristics (12, 13). We developed a customize computational routine to quantify, in milliseconds, the phases of the vibratory cycle of the vocal folds. We quantified the closed, open, closing and opening phases, as well as, the closing, opening, and speed quotients (Figure 3). The closed quotient is an estimation of the fraction of time that the vocal folds are closed during each vibratory cycle. The opened quotient is an estimation of the time that the vocal folds are open during each vibratory cycle, and speed quotient is a relationship between the opening and closing times of the vocal folds (14).

Statistical analysis

The results of the analyses before and after of the vocal exercise were compared using the paired t test with a significance level of 0.05.

RESULTS

The results of the statistical analysis are presented in Figures 4, 5, and 6, and in Tables 1 and 2. There were statistical differences in the following parameters for female vocal folds: opened phase, closed phase, closing phase, and closing and opening quotients. Male vocal folds showed statistical differences in the speed quotient. Results show that opened and closing phases increase for female (p=0.05 and p=0.026, respectively). The closed phase decrease for female vocal folds (p=0.046). For female subjects the opening quotient increase (p=0.049) and decrease in the closing quotient (p=0.029). Male vocal folds showed a decrease in the speed quotient (p=0.048).

Figure 1. Laryngoscopic examination performed with the Richard Wolf equipment for high-speed image recordings. The laryngeal images are captured at a rate of 4000 frames per second. The equipment is composed of rigid endoscope (left) with camera (Endocam - 5562) coupled to a 300W light source (AUTO LP 5132) and a computer with color monitor (right) for image transmission and data analysis.

Figure 2. Top: Clipping of one vibration period from a high-speed recorded sequence of male healthy vocal folds. Bottom: High-speed kymography from high-speed recording of the sequence displayed at the top.

Figure 3. Kymographic image showing vibratory cycle phases. PTCV: total phase of a single vibratory cycle, FF: closed phase, FA: opened phase, Ff: closing phase, and Fa: opening phase.

Figure 4. Distribution of average values and standard deviation of the closed phase extracted before and after vibration exercise by high-speed kymography of female vocal folds.

Figure 5. Distribution of average values and standard deviation of the opened phase extracted before and after the vibration exercises by high-speed kymography of female vocal folds.

Figure 6. Distribution of average values and standard deviation of the closing phase extracted before and after the vibration exercises by high-speed kymography of female vocal folds.

DISCUSSION

Voiced vibration is a semi-occluded vocal tract exercise that allows vocalization with normal intensity and less mechanical trauma to tissues during phonation. This exercise directs the vocal fold oscillation condition, in which the vocal folds are slightly adduced by retroflex pressure in the vocal tract. Glottal airflow and vocal fold collision are then minimized (1). In this study, the goal was to identify the immediate effects of voiced vibrations in healthy vocal folds. Thus, we applied a high-speed kymograph analysis method. Our results corroborated the effect described by Titez (1). The vibratory patterns of the female vocal folds showed statistically significant differences. The results indicate soft contact between the female vocal folds due to decreased contact speed of the mucosa (P = 0.026* for the closing phase), and the time the vocal folds remained closed (P = 0.046* for the closed phase). Furthermore, there was an increase in the time the vocal folds remained open (P = 0.05* for the opened phase). The changes in the vibratory patterns of the male group suggest soft contact between vocal folds during phonation. The results point to a slow contact of the vocal fold mucosa (P = 0.08 for the closing phase), larger opening speed (P = 0.06 in the opening phase), and an increase in the time the glottis remained in the open position (P = 0.06 for the opened phase). However, there were no significant results for the males, indicating the need for further research with a larger group of subjects.

This is the first study of the effects of vocal techniques through kymographic parameters. Interestingly, our results agree with those of a videostroboscopic analysis (15). However, Pereira et al. study investigated female voices after applying a set of vocal techniques that included voiced vibrations. GASKILL and ERICKSON studied voiced vibrations by estimating the closing quotient on the basis of electroglottographic signals (EGG). The authors found a lower closing quotient during the voiced vibrations, concluding that for the glottic cycle to occur during voiced vibrations, the airflow has to be increased to sustain the vocal fold and lip oscillation (16). In our study, the closed quotient was extracted from the high-speed kymographs, and these results are in agreement with the conclusions presented by Gaskill and Erickson (16). The results can be interpreted to represent the changes in the vibratory pattern behavior of the vocal folds with reduced mechanical trauma of the tissues during the vocal exercise.

The pre- and post-exercise data for the quotients provided more support than the kymographic parameter data. In the female vocal folds, the effect produced by vocal exercise is evidenced by the decreased closing quotients (P = 0.029*), indicating the less time needed for the contact between vocal folds than for the entire glottic cycle. Furthermore, the average opening quotient increased (P = 0.049*), indicating an increase in the time the glottis remained in the open position throughout the glottic cycle. These relationships showed the expected voiced vibration effect; in other words, after the vocal exercise, the glottis remains open longer. In the male vocal folds, the expected effect of the exercise was indicated by the speed quotient (P = 0.048*), which showed a significant decrease in its average value after the voiced vibrations. This could be due to the increase in the speed of mucosal contact in the vocal folds. This behavior can be understood as an effect that prioritizes the slow closing and fast contact of the vocal folds to avoid the mechanical trauma collisions during phonation.

Paired t test with significance level 0.05.

CONCLUSIONS

This study allowed identification of the immediate effect of soft contact between the healthy vocal folds after the voiced vibration. The female vocal folds exhibited greater susceptibility to the influence of the vocal exercise. However, further research with a greater number of samples, mainly male samples, is necessary to corroborate the results of this paper.

High-speed kymography proved to be an efficient voice evaluation tool that could be used in other vocal exercise studies.

ACKNOWLEDGMENTS

To the Bio-Engineering Post graduate program - University of São Paulo - São Carlos. To the members of the Otolaryngology Department, School of Medicine, University of São Paulo and to the BiomedicalSignal Processing Laboratory of EESC/USP. This work was funded by a research grant from São Paulo's Research Foundation (FAPESP:2010/03345-0) and the financing of the High-Speed Videoendoscopy equipment (FAPESP: 09/511698-2).

REFERENCES

1. Titze IR. Voice Training and therapy with a semi-occluded vocal tract: rational and scientific underpinnings. J Speech Lang Hear Res. 2006;49:448-59.

2. Elliot N, Sundberg J, Gramming P. Physiological aspects of a vocal exercise. J Voice. 1997;11(2):171-7.

3. Azevedo LL, Passaglio KT, Rosseti MB, Silva CB, Oliveira BF, Costa RC. Avaliação da performance vocal antes e após a vibração sonorizada de língua. Rev Soc Bras Fonoaudiol. 2010;15(3):343-8.

4. Schwarz K, Cielo CA. Modificações laríngeas e vocais produzidas pela técnica de vibração sonorizada de língua. Pró-Fono. 2009;21(2):161-6.

5. Tsuji DH, Sennes LU. Videoquimografia de laringe: novo método de avaliação da vibração cordal. Arq Fund Otorrinolaringol. 1998;2(4):136-40.

6. Koishi HU, Tsuji DH, Imamura R, Sennes LU. Variação da intensidade vocal: estudo da vibração das pregas vocais em seres humanos com videoquimografia. Rev Bras Otorrinolaringol. 2003;69(4):464-70.

7. Yan Y; Damrose E; Bless D. Functional Analysis Of Voice Using Simultaneous High-Speed Imaging and Acoustic Recordings. J Voice. 2007;21(5):604-16.

8. Gasparini G, Behlau M. Quality of life: validation of the Brazilian version of the voice-related quality of life (V-RQOL) measure. J Voice. 2009;23(1):76-81.

9. Behlau M, Oliveira G, Santos La, Ricarte A. Validação no Brasil de protocolos de auto-avaliação do impacto de uma disfonia. Pró-Fono. 2009;21(4):326-32.

10. Yamazaki R, Leão SHS, Madazio G, Padovani M, Azevedo R, Behlau M. Correspondência entre escala analógico-visual e a escala numérica na avaliação perceptivo-auditiva das vozes. XVI Congresso Brasileiro de Fonoaudiologia, 2008, Campos do Jordão (SP).

11. Menezes MH, de Campos Duprat A, Costa HO. Vocal and laryngeal effects of voiced tongue vibration technique according to performance time. J Voice. 2005;19(1):61-70.

12. Kurita, S. Layer structure of the human vocal fold: morphological investigation. Otologia (Fukuoka). 1980;26:973-97.

13. Haji T, Isshiki N, Mori K, Omori K, Taira T, Honjo I. Experimental study of the mobility of the vocal fold mucosa. Folia Phoniatr. 1991;43(1):21-8.

14. Hirano M. Phonosurgery. Basic and clinical investigations. Otologia (Fukuoka) (Suppl 1) 1975;21:239-440.

15. Pereira EC, Silvério KCA, Marques JM, Camargo PAM. Efeito imediato de técnicas vocais em mulheres sem queixa vocal. Rev CEFAC. 2011;13(5):886-94.

16. Gaskill CS, Erickson ML. The effect of a voiced lip trill on estimated glottal closed quotient. Journal of Voice. 2008;22(6):634-43.

1) Master in Bio-Engineering. Doctorate. Post Graduation in Bio-Engineering - University of São Paulo - São Carlos.
2) Doctor in Sciences. Post-Doctorate. Department of Otolaryngology, School of Medicine, University of São Paulo.
3) Doctor in Sciences. Department of Otolaryngology, School of Medicine, University of São Paulo.
4) Master in Sciences of Otolaryngology. Doctorate. Department of Otolaryngology, School of Medicine, University of São Paulo.
5) Professor in the Department of Otolaryngology, School of Medicine, University of São Paulo.
6) Doctor in Sciences. Professor in the Department of Electric Engineering of University Federal of São Carlos.

Affiliation: Programa de Pós-graduação em Ciências na área de Bioengenharia da Escola de Engenharia de São Carlos, Faculdade de Medicina de Ribeirão Preto e Instituto de Química de São Carlos, Universidade de São Paulo - USP. São Carlos /SP - Brazil. Institution: University of São Paulo - Post Graduation in Bio-Engineering. São Carlos / SP - Brazil. Mailing address: Regina Aparecida Pimenta - João Ramalho Street - Jardim Centenário - São Carlos / SP - Brazil - Zip-code: 13564-090 - Telephone: (+55 16) 3373-8745 - E-mail: ginapimenta@usp.br.

Financial Support: This work was funded by a research grant from São Paulo's Research Foundation (FAPESP: 2010/03345-0) and the financing of the High-Speed Videoendoscopy equipment (FAPESP: 09/511698-2).

Article received in August 7, 2012. Article accepted on November 4, 2012.