🧠 DEEP LEARNING & COMPUTER VISION
📘 Buku Ajar Berbasis Proyek | Project-Based Textbook
📖 Deep Learning & Computer Vision
Buku Ajar untuk Satu Semester
Penulis: Tim Pengajar
Target: Mahasiswa tingkat akhir sarjana (programmer Python)
Durasi: 16 pertemuan (8 minggu efektif + UTS + UAS)
Bobot Penilaian: 100% (terlampir)
Halaman Judul
Deep Learning & Computer Vision
Buku Ajar Berbasis Proyek untuk Satu Semester
Kata Pengantar
Buku ini disusun untuk memenuhi kebutuhan mata kuliah Deep Learning dan Computer Vision dalam durasi satu semester (16 minggu). Materi disusun dari konsep dasar hingga penerapan di cloud dan edge computing, dengan pendekatan pembelajaran berbasis proyek (project-based learning). Setiap bab dilengkapi dengan tujuan pembelajaran, materi teori, panduan praktikum, serta instrumen penilaian. Buku ini juga mengintegrasikan penggunaan platform modern seperti TensorFlow, PyTorch, YOLO, AWS/GCP, dan Docker.
Daftar Isi
- Bagian 1: Pendahuluan dan Fundamental Deep Learning (Minggu 1–2)
- Bagian 2: Klasifikasi Citra dan Evaluasi Model (Minggu 3)
- Bagian 3: Object Detection dan Face Recognition (Minggu 4–5)
- Bagian 4: Cloud Computing, Deployment, dan Edge AI (Minggu 6–7)
- Bagian 5: Proyek End-to-End dan Studi Kasus (Minggu 8)
Daftar Pustaka & Lampiran: RPS Lengkap, Rubrik Penilaian, Kunci Jawaban
📌 BAGIAN 1: PENDAHULUAN DAN FUNDAMENTAL DEEP LEARNING
Minggu 1–2
Minggu 1: Konsep Dasar AI, Deep Learning, dan Computer Vision
Pertemuan 1 – Teori: Pengantar AI, DL, CV
1.1 Definisi Artificial Intelligence (AI)
AI adalah bidang ilmu komputer yang berfokus pada pembuatan mesin yang dapat meniru kecerdasan manusia. AI dibagi menjadi: Narrow AI (Weak AI) dan General AI (Strong AI).
1.2 Sejarah Deep Learning
1958 – Perceptron, 1986 – Backpropagation, 2012 – AlexNet, 2017 – Transformer.
1.3 Peran Computer Vision (CV)
Tugas utama CV: klasifikasi gambar, deteksi objek, segmentasi semantik, generasi gambar.
1.4 Use Case Industri
| Industri | Aplikasi | Teknologi |
|---|---|---|
| Kesehatan | Deteksi tumor CT scan | CNN klasifikasi |
| Retail | Self-checkout otomatis | Object detection |
| Otomotif | Mobil otonom | YOLO + segmentasi |
| Keamanan | Face recognition | FaceNet + embedding |
Tugas: Quiz awal – 10 soal pilihan ganda via LMS (Bobot 5%)
Pertemuan 2 – Praktikum: Neural Network dan CNN
Perceptron, Activation Function, Konvolusi, Pooling
import tensorflow as tf
from tensorflow.keras import layers, models
# NN untuk XOR gate
model = models.Sequential([
layers.Dense(4, activation='relu', input_shape=(2,)),
layers.Dense(1, activation='sigmoid')
])
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])
X = [[0,0],[0,1],[1,0],[1,1]]
y = [0,1,1,0]
model.fit(X, y, epochs=200, verbose=0)
print(model.predict([[0,1]]))
Tugas: Latihan coding – buat NN untuk logika AND (Bobot 5%)
Minggu 2: Pengolahan Data Citra dan Klasifikasi Dasar
Pertemuan 3 – Praktikum: Image Preprocessing
Resize, normalisasi, augmentasi, labeling. Contoh augmentasi:
from tensorflow.keras.preprocessing.image import ImageDataGenerator
datagen = ImageDataGenerator(rotation_range=20, horizontal_flip=True)
Tugas: Dataset terstruktur dengan augmentasi 5x lipat (Bobot 5%)
Pertemuan 4 – Praktikum: Image Classification dengan CNN
Arsitektur CNN: Conv2D + MaxPool + Flatten + Dense. Akurasi minimal 85% pada dataset kustom.
model = models.Sequential([
layers.Conv2D(32, (3,3), activation='relu', input_shape=(32,32,3)),
layers.MaxPooling2D((2,2)),
layers.Conv2D(64, (3,3), activation='relu'),
layers.MaxPooling2D((2,2)),
layers.Flatten(),
layers.Dense(64, activation='relu'),
layers.Dense(10, activation='softmax')
])
Bobot tugas: 5%
📌 BAGIAN 2: EVALUASI, OBJECT DETECTION, FACE RECOGNITION
Minggu 3: Evaluasi Model & Object Detection
Confusion Matrix, Precision, Recall, F1-Score, ROC-AUC. Juga YOLO, anchor box, IoU.
Minggu 4-5: Face Recognition & Object Detection Lanjut
Materi mencakup YOLO, SSD, dan FaceNet embedding.
📌 BAGIAN 3: CLOUD COMPUTING, DEPLOYMENT, DAN EDGE AI
Minggu 6: Cloud AI dan Deployment
Layanan Cloud: AWS Rekognition, GCP Vision AI, Azure Computer Vision.
| Provider | Layanan | Kemampuan |
|---|---|---|
| AWS | Rekognition | Deteksi wajah, teks |
| GCP | Vision AI | OCR, label detection |
| Azure | Computer Vision | Deskripsi gambar |
📌 BAGIAN 4: PROYEK END-TO-END DAN STUDI KASUS
Minggu 8: Pipeline End-to-End & Presentasi
Data pipeline, training pipeline, deployment pipeline. Studi kasus: Autonomous Vehicle, Surveillance, Retail Analytics.
Ujian Akhir: Presentasi dan demo proyek kelompok.
📖 DEEP LEARNING & COMPUTER VISION
One-Semester Textbook
Authors: Teaching Team
Target: Senior undergraduate students (Python programmers)
Duration: 16 sessions (8 effective weeks + Midterm + Final Exam)
Assessment Weight: 100% (see appendix)
Title Page
Deep Learning & Computer Vision
Project-Based One-Semester Textbook
Preface
This textbook is designed to meet the requirements of Deep Learning and Computer Vision courses within one semester (16 weeks). The materials progress from fundamental concepts to cloud and edge computing applications, using a project-based learning approach. Each chapter includes learning objectives, theoretical content, lab guides, and assessment instruments. The textbook also integrates modern platforms such as TensorFlow, PyTorch, YOLO, AWS/GCP, and Docker.
Table of Contents
- Part 1: Introduction and Deep Learning Fundamentals (Weeks 1–2)
- Part 2: Image Classification & Model Evaluation (Week 3)
- Part 3: Object Detection and Face Recognition (Weeks 4–5)
- Part 4: Cloud Computing, Deployment, and Edge AI (Weeks 6–7)
- Part 5: End-to-End Project and Case Studies (Week 8)
References & Appendices: Full Syllabus, Grading Rubric, Answer Keys
📌 PART 1: INTRODUCTION & DEEP LEARNING FUNDAMENTALS
Weeks 1–2
Week 1: Core Concepts of AI, Deep Learning, and Computer Vision
Session 1 – Theory: Introduction to AI, DL, CV
1.1 Definition of Artificial Intelligence (AI)
AI is the field of computer science focused on creating machines that can mimic human intelligence. AI is divided into Narrow AI (Weak AI) and General AI (Strong AI).
1.2 History of Deep Learning
1958 – Perceptron, 1986 – Backpropagation, 2012 – AlexNet, 2017 – Transformer.
1.3 Role of Computer Vision (CV)
Main CV tasks: image classification, object detection, semantic segmentation, image generation.
1.4 Industry Use Cases
| Industry | Application | Technology |
|---|---|---|
| Healthcare | CT scan tumor detection | CNN classification |
| Retail | Automated self-checkout | Object detection |
| Automotive | Autonomous vehicles | YOLO + segmentation |
| Security | Face recognition | FaceNet + embedding |
Assignment: Initial quiz – 10 multiple-choice questions via LMS (Weight 5%)
Session 2 – Lab: Neural Networks and CNN
Perceptron, Activation Functions, Convolution, Pooling
import tensorflow as tf
from tensorflow.keras import layers, models
# NN for XOR gate
model = models.Sequential([
layers.Dense(4, activation='relu', input_shape=(2,)),
layers.Dense(1, activation='sigmoid')
])
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])
X = [[0,0],[0,1],[1,0],[1,1]]
y = [0,1,1,0]
model.fit(X, y, epochs=200, verbose=0)
print(model.predict([[0,1]]))
Task: Coding exercise – build a NN for AND logic function (Weight 5%)
Week 2: Image Data Processing & Basic Classification
Session 3 – Lab: Image Preprocessing
Resize, normalization, augmentation, labeling. Augmentation example:
from tensorflow.keras.preprocessing.image import ImageDataGenerator
datagen = ImageDataGenerator(rotation_range=20, horizontal_flip=True)
Assignment: Structured dataset with 5x augmentation (Weight 5%)
Session 4 – Lab: Image Classification with CNN
CNN architecture: Conv2D + MaxPool + Flatten + Dense. Achieve at least 85% accuracy on custom dataset.
model = models.Sequential([
layers.Conv2D(32, (3,3), activation='relu', input_shape=(32,32,3)),
layers.MaxPooling2D((2,2)),
layers.Conv2D(64, (3,3), activation='relu'),
layers.MaxPooling2D((2,2)),
layers.Flatten(),
layers.Dense(64, activation='relu'),
layers.Dense(10, activation='softmax')
])
Assignment weight: 5%
📌 PART 2: EVALUATION, OBJECT DETECTION, FACE RECOGNITION
Week 3: Model Evaluation & Introduction to Object Detection
Confusion Matrix, Precision, Recall, F1-Score, ROC-AUC. Also YOLO, anchor boxes, IoU.
Weeks 4-5: Advanced Face Recognition & Object Detection
Content includes YOLO, SSD, and FaceNet embedding.
📌 PART 3: CLOUD COMPUTING, DEPLOYMENT & EDGE AI
Week 6: Cloud AI and Deployment
Cloud services: AWS Rekognition, GCP Vision AI, Azure Computer Vision.
| Provider | Service | Capabilities |
|---|---|---|
| AWS | Rekognition | Face detection, text |
| GCP | Vision AI | OCR, label detection |
| Azure | Computer Vision | Image description |
📌 PART 4: END-TO-END PROJECT & CASE STUDIES
Week 8: End-to-End Pipeline & Final Presentations
Data pipeline, training pipeline, deployment pipeline. Case studies: Autonomous Vehicle, Surveillance, Retail Analytics.
Final Exam: Group project presentation and demo.
Comments