Exploratory analysis and visualization of speech and music by locally linear embedding

Jain, Viren; Saul, Lawrence K

Exploratory analysis and visualization of speech and music by locally linear embedding

dc.contributor.author	Jain, Viren
dc.contributor.author	Saul, Lawrence K
dc.date	2023-05-16T21:27:01.000
dc.date.accessioned	2023-05-22T12:47:23Z
dc.date.available	2023-05-22T12:47:23Z
dc.date.issued	2004-05-17
dc.date.submitted	2004-07-27T10:20:26-07:00
dc.description.abstract	Many problems in voice recognition and audio processing involve feature extraction from raw waveforms. The goal of feature extraction is to reduce the dimensionality of the audio signal while preserving the informative signatures that, for example, distinguish different phonemes in speech or identify particular instruments in music. If the acoustic variability of a data set is described by a small number of continuous features, then we can imagine the data as lying on a low dimensional manifold in the high dimensional space of all possible waveforms. Locally linear embedding (LLE) is an unsupervised learning algorithm for feature extraction in this setting. In this paper, we present results from the exploratory analysis and visualization of speech and music by LLE.
dc.description.comments	Copyright © 2004 IEEE. Reprinted from Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), held 17-24 May 2004, Montreal, Quebec, Canada. Publisher URL: http://ieeexplore.ieee.org/xpl/tocresult.jsp?isNumber=29345&page=17 This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
dc.identifier.uri	https://repository.upenn.edu/handle/20.500.14332/6339
dc.legacy.articleid	1002
dc.legacy.fulltexturl	https://repository.upenn.edu/cgi/viewcontent.cgi?article=1002&context=cis_papers&unstamped=1
dc.source.issue	3
dc.source.journal	Departmental Papers (CIS)
dc.source.peerreviewed	true
dc.source.status	published
dc.subject.other	voice recognition
dc.subject.other	speech recognition
dc.subject.other	audio processing
dc.subject.other	signal processing
dc.subject.other	pattern recognition
dc.subject.other	acoustics
dc.title	Exploratory analysis and visualization of speech and music by locally linear embedding
dc.type	Presentation
digcom.contributor.author	isAuthorOfPublication\|email:viren@seas.upenn.edu\|institution:University of Pennsylvania\|Jain, Viren
digcom.contributor.author	isAuthorOfPublication\|email:lsaul@cis.upenn.edu\|institution:University of Pennsylvania\|Saul, Lawrence K
digcom.identifier	cis_papers/3
digcom.identifier.contextkey	23479
digcom.identifier.submissionpath	cis_papers/3
digcom.type	conference
dspace.entity.type	Publication
relation.isAuthorOfPublication	f19b1679-63a3-46d2-93d4-be1317ccf2f4
relation.isAuthorOfPublication	a46f7bc2-dc15-4b9e-a854-f0055d4d4a2f
relation.isAuthorOfPublication.latestForDiscovery	f19b1679-63a3-46d2-93d4-be1317ccf2f4
upenn.schoolDepartmentCenter	Departmental Papers (CIS)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: jainsaul.pdf
Size:: 1.52 MB
Format:: Adobe Portable Document Format

Download

Collection

Presentations

Exploratory analysis and visualization of speech and music by locally linear embedding

Files

Original bundle

Collection

Usage statistics

Penn's Heritage