Prospectus

nl en

Audio Processing and Indexing

Course
2024-2025

Admission requirements

Not applicable.

Description

During this seminar the fundamentals of audio processing and indexing will be studied. Applications in the area of speech recognition and understanding, audio synthesis and content based audio retrieval will be discussed. State of the art work on speech recognition, speech synthesis and content based audio retrieval will be studied and presented by the participants.

The seminar starts with several lectures and accompanying assignments in the form of workshops; followed by a literature selection, study, and presentations by all the students; the seminar ends with final project demos / presentations.

Course objectives

At the end of the seminar, the student is able to:

  • Explain and apply the fundamental methods of audio processing, audio indexing, speech synthesis, and speech recognition and understanding.

  • Apply basic audio processing algorithms to audio data and anayse and evaluate their performance.

  • Understand, analyse, evaluate, explain and discuss selected scientific research and experiments in the field of science and technology of audio processing, audio retrieval and spoken language processing.

  • Acquire, analyse and evaluate necessary knowledge of state of the art methods in the field of audio indexing and retrieval by studying scientific publications from journals and proceedings.

  • Create and design, implement, execute and report on a scientific audio processing or indexing experiment.

Timetable

The most recent timetable can be found at the Computer Science (MSc) student website.

In MyTimetable, you can find all course and programme schedules, allowing you to create your personal timetable. Activities for which you have enrolled via MyStudyMap will automatically appear in your timetable.

Additionally, you can easily link MyTimetable to a calendar app on your phone, and schedule changes will be automatically updated in your calendar. You can also choose to receive email notifications about schedule changes. You can enable notifications in Settings after logging in.

Questions? Watch the video, read the instructions, or contact the ISSC helpdesk.

Note: Joint Degree students from Leiden/Delft need to combine information from both the Leiden and Delft MyTimetables to see a complete schedule. This video explains how to do it.

Mode of instruction

  • Lectures

  • Seminar

  • Workshops

  • Presentations

  • Projects

  • Reports

Course load

Hours of study: 168 (= 6 EC)
Lectures: 26
Practical work: 62
Other: 80

Assessment method

Presentation (20% of the final grade) and Project (40% of the final grade). 4 workshops (each 10% of the final grade, totalling 40% of the final grade).
The teacher will inform the students how the inspection of and follow-up discussion of the work will take place.

Reading list

Lecture slides and further materials will be made available on the website of the course.

List of recommended books:

  • Fundamentals of Speech Recognition by Lawrence Rabiner, and Biing-Hwang Juang (Hardcover, 507 pages; Publisher: Pearson Education POD; ISBN: 0130151572; 1st edition, April 12, 1993)

  • Theory and Applications of Digital Speech Processing by Lawrence Rabiner amd Ronald Schafer, (Pubisher: Pearson, ISBN 0-13-603428-4, 1st edition, 2011).

  • Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) by Dong Yu and Li Deng, Springer; 2015 edition (November 11, 2014).

  • Deep Learning for NLP and Speech Recognition by Uday Kamath, John Liu, James Whitaker (Springer, 219).

Registration

As a student, you are responsible for enrolling on time through MyStudyMap.

In this short video, you can see step-by-step how to enrol for courses in MyStudyMap.
Extensive information about the operation of MyStudyMap can be found here.

There are two enrolment periods per year:

  • Enrolment for the fall opens in July

  • Enrolment for the spring opens in December

See this page for more information about deadlines and enrolling for courses and exams.

Note:

  • It is mandatory to enrol for all activities of a course that you are going to follow.

  • Your enrolment is only complete when you submit your course planning in the ‘Ready for enrolment’ tab by clicking ‘Send’.

  • Not being enrolled for an exam/resit means that you are not allowed to participate in the exam/resit.

Contact

Lecturer: dr. Erwin M. Bakker
Assistant: To be announced.
Website: Audio Processing and Indexing

Remarks

Software
Starting from the 2024/2025 academic year, the Faculty of Science will use the software distribution platform Academic Software. Through this platform, you can access the software needed for specific courses in your studies. For some software, your laptop must meet certain system requirements, which will be specified with the software. It is important to install the software before the start of the course. More information about the laptop requirements can be found on the student website.