Y0250022
2 Introduction to audio and video processing
In design—especially when handling GUIs centered on audiovisual experiences—there are many situations in which some form of
audio or video processing is required. Without understanding the basic principles underlying such processing, it is difficult
to apply methods appropriately to a given purpose.
This course provides an overview of the fundamental data structures and processing principles for audio and video, with the
aim of cultivating the foundational techniques and applied skills necessary to carry out appropriate audiovisual processing.
Students will study the structures of audio, image, and video data as well as the basic principles of their processing. By
deepening understanding through hands-on practice, students will acquire fundamental skills in audiovisual processing.
- Explain the basic data structures of audio, images, and video.
- Explain the principles behind major processing methods for audio, images, and video.
- Perform appropriate processing on audio, images, and video in accordance with stated goals.
Relationship between 'Goals and Objectives' and 'Course Outcomes'
|
Tasks |
Report |
Presentation |
Total. |
| 1. |
10% |
10% |
10% |
30% |
| 2. |
10% |
10% |
10% |
30% |
| 3. |
30% |
10% |
|
40% |
| Total. |
50% |
30% |
20% |
- |
|
Class schedule |
HW assignments (Including preparation and review of the class.) |
Amount of Time Required |
| 1. |
How Sound Works: Observe the three elements of sound and wave properties to understand the relationship between sonic expression
and perception.
|
Conduct the assigned task |
200minutes |
| 2. |
Sound and Visualization: “See” sound via composite waves to understand structure, overlap, and change. |
Conduct the assigned task |
200minutes |
| 3. |
How Data Is Formed (1): Overview of how image and audio data are represented and compressed. |
Conduct the assigned task |
200minutes |
| 4. |
How Data Is Formed (2): Understand data representations suited to their properties through contrasts such as raster vs. vector. |
Conduct the assigned task |
200minutes |
| 5. |
Parameter Control: Manipulate video and audio by programming to learn the principles behind processing parameters. |
Conduct the assigned task |
200minutes |
| 6. |
Data Acquisition and Conversion (1): Acquire audio/video data via microphones and cameras, perform conversion, and explore
multimodal data processing.
|
Conduct the assigned task |
200minutes |
| 7. |
Data Acquisition and Conversion (2): Continue conversion workflows between audio and video data to deepen understanding of
multimodal processing.
|
Conduct the assigned task |
200minutes |
| Total. |
- |
- |
1400minutes |
Evaluation method and criteria
Each assignment will be evaluated on a four-level scale (S, A, B, F) with the following scores: S = 10, A = 8, B = 6, F =
0.
Each assignment carries a designated weight, and the final grade is calculated as the ratio of the student’s total score to
the total possible score if S were earned on all assignments.
A score rate of 60% or higher is required to pass.
The evaluation method may be revised as necessary according to class progress or changes in assignments; any revisions will
be explained in class.
Feedback on exams, assignments, etc.
| ways of feedback |
specific contents about "Other" |
| Feedback in the class |
|
Textbooks and reference materials
Will be specified as appropriate during class.
Because the course primarily handles waveform data, students are strongly encouraged to acquire basic knowledge of the Fourier
transform in advance.
Office hours and How to contact professors for questions
- Consultations are available before or after class. If necessary, please contact the instructor in advance by email to arrange
an appointment.
Non-regionally-oriented course
Development of social and professional independence
- Course that cultivates a basic problem-solving skills
Most classes are interactive
Course by professor with work experience
| Work experience |
Work experience and relevance to the course content if applicable |
| Applicable |
The course is taught by a faculty member with professional experience in video and audio processing. |
Education related SDGs:the Sustainable Development Goals
- 9.INDUSTRY, INNOVATION AND INFRASTRUCTURE
- 12.RESPONSIBLE CONSUMPTION & PRODUCTION
Last modified : Mon Oct 20 04:04:01 JST 2025