COMP0197 CW1
1
COMP0197: Applied Deep Learning
Assessed Component 1 (Individual Coursework) 2023-24
Submission before 16:00 (UK time), 21st March 2024 (subject to change), on Moodle
Introduction
This is the first of two assessed coursework. This coursework accounts for 50% of the module with three
independent tasks, and for each task, a task script needs to be submitted with other supporting files and
data. No separate written report is required.
There are hyperlinks in the document for further reference. Throughout this document, various parts of
the text are highlighted, for example:
The aim of the coursework is to develop and assess your ability a) to understand the technical and
scientific concepts behind deep learning theory and applications, b) to research the relevant methodology
and implementation details of the topic, and c) to develop the numerical algorithms in Python and one of
the deep learning libraries TensorFlow and PyTorch. Although the assessment does not place emphasis
on coding skills and advanced software development techniques, basic programming knowledge will be
taken into account, such as the correct use of NumPy arrays, tensors – as opposed to, for example,
unnecessary for-loops, sufficient commenting and consistent code format. Up to [20%] of the relevant
marks may be deducted for substandard programming practice.
Do NOT use this document for any other purposes or share with others. The coursework remains UCL
property as teaching materials. You may be risking breaching intellectual property regulations and/or
academic misconduct, if you publish the details of the coursework or distribute this further.
Conda environment and Python packages
No external code (open-source or not) should be used for the purpose of this coursework. No other
packages should be used, unless specified and installed within the conda environment below. This will be
assessed by running the submitted code on the markers’ computers, within a conda environment created
as follows, for either TensorFlow or PyTorch. Make sure your OS is up-to-date to minimise potential
compatibility issues.
conda create -n comp0197-cw1-tf pillow=10.2 pip=19.3 && conda activate comp0197-cw1-tf && pip
install tensorflow==2.13
conda create -n comp0197-cw1-pt -c pytorch python=3.12 pytorch=2.2 torchvision=0.17
Class names are highlighted for those mandatory classes that should be found in your submitted code.
Function names are highlighted for those mandatory functions that should be found in your submitted
code.
Printed messages on terminal when running the task scripts.
Visualisation saved into PNG files with task scripts.
[5]: square brackets indicate marks, with total marks being 100, for 50% of the module assessment.
“filepath.ext”: quotation marks indicate the names of files or folders.
commands: commands run on bash or Python terminals, given context.
COMP0197 CW1
2
Use one of the two for your coursework and indicate with your submitted folder name, “cw1-tf” or “cw1-
pt”. Use the command conda list -n comp0197-cw1-xx to see the available libraries for this coursework
(“xx” is either “tf” or “pt”). You can choose to use either TensorFlow or PyTorch, but NOT both of them in
this coursework, as it is designed to have a balanced difficulties from different tasks. [100%] of the
relevant marks may be deducted for using external code.
Working directory and task script
Each task should have a task folder, named as “task1”, “task2” and “task3”. A Python task script should
be a file named as “task.py”, such that the script can be executed on a bash terminal when the task folder
is used as the current/working directory, within the conda environment described above:
python task.py
It is the individual’s responsibility to make sure the submitted task scripts can run, in the above-specified
conda environment. If using data/code available in module tutorials, copies or otherwise automated links
need to be provided to ensure a standalone executability of the submitted code. Care needs to be taken
in correct use of relative paths, as it was found to be one of the most common issues in the past. Jupyter
Notebook files are NOT allowed. Up to [100%] of the relevant marks may be deducted if no runnable task
script is found.
Printing and visualisation
Summarising and communicating your implementation and quantitative results is being assessed as part
of the module learning outcome. Each task specifies relevant information and messages to be printed on
terminal, which may contain description, quantitative summary and brief remarks. The printed messages
are expected to be concise, accurate and clear.
When the task requires visualising results (usually in the form of image), the code should save the results
into a PNG file in the respective working directory. These PNG files should be submitted with the code,
although they can be generated by the code as well. Please see examples in the module repository using
Pillow. Please note that matplotlib cannot be used in the task scripts but may be a good tool during
development. Up to [50%] of the relevant marks maybe deducted if this is not followed.
Design your code
The functions/classes/files/messages highlighted (see Introduction) are expected to be found in your
submitted code, along with the task scripts. If not specifically required, you have freedom in designing
your own code, for example, data type, variables, functions, scripts, modules, classes and/or extra results
for discussion. These will be assessed for complementing your work but not for design aspects.
The checklist
This is a list of things that help you to check before submission.
✓ The coursework will be submitted as a single “cw1-xx” folder, compressed as a single zip file.
✓ Under your “cw1-xx” folder, you should have three subfolders, “task1”, “task2” and “task3”.
✓ The task scripts run without needing any additional files, data or customised paths.
✓ All the classes and functions colour-coded in this document can be found in the exact names.
✓ Check all the functions/classes have a docstring indicating a brief description of its purpose,
together with data type, size and what-it-is, for each input argument and output.
COMP0197 CW1
3
Task 1 Stochastic Minibatch Gradient Descent for Linear Models
• Implement a polynomial function polynomial_fun, that takes two input arguments, a weight vector 𝐰
of size 𝑀 + 1 and an input scalar variable 𝑥, and returns the function value 𝑦. The polynomial_fun
should be vectorised for multiple pairs of scalar input and output, with the same 𝐰. [5]
𝑦 = ∑ 𝑤𝑚𝑥
𝑚
𝑀
𝑚=0
• Using the linear algebra modules in TensorFlow/PyTorch, implement a least square solver for fitting
the polynomial functions, fit_polynomial_ls, which takes 𝑁 pairs of 𝑥 and target values𝑡 as input, with
an additional input argument to specify the polynomial degree 𝑀, and returns the optimum weight
vector 𝐰̂ in least-square sense, i.e. ‖𝑡 − 𝑦‖
2
is minimised. [5]
• Using relevant functions/modules in TensorFlow/PyTorch, implement a stochastic minibatch gradient
descent algorithm for fitting the polynomial functions, fit_polynomial_sgd, which has the same input
arguments as fit_polynomial_ls does, with additional two input arguments, learning rate and
minibatch size. This function also returns the optimum weight vector 𝐰̂. During training, the function
should report the loss periodically using printed messages. [5]
• Implement a task script “task.py”, under folder “task1”, performing the following: [15]
o Use polynomial_fun (𝑀 = 2, 𝐰 = [1,2,3]
T
) to generate a training set and a test set, in the
form of respectively and uniformly sampled 20 and 10 pairs of 𝑥, 𝑥𝜖[−20, 20], and 𝑡. The
observed 𝑡 values are obtained by adding Gaussian noise (standard deviation being 0.5) to 𝑦.
o Use fit_polynomial_ls (𝑀𝜖{2,3,4}) to compute the optimum weight vector 𝐰̂ using the
training set. For each 𝑀, compute the predicted target values 𝑦̂ for all 𝑥 in both the training
and test sets.
o Report, using printed messages, the mean (and standard deviation) in difference a) between
the observed training data and the underlying “true” polynomial curve; and b) between the
“LS-predicted” values and the underlying “true” polynomial curve.
o Use fit_polynomial_sgd (𝑀𝜖{2,3,4}) to optimise the weight vector 𝐰̂ using the training set.
For each 𝑀, compute the predicted target values 𝑦̂ for all 𝑥 in both the training and test sets.
o Report, using printed messages, the mean (and standard deviation) in difference between the
“SGD-predicted” values and the underlying “true” polynomial curve.
o Compare the accuracy of your implementation using the two methods with ground-truth on
test set and report the root-mean-square-errors (RMSEs) in both 𝐰 and 𝑦 using printed
messages.
o Compare the speed of the two methods and report time spent in fitting/training (in seconds)
using printed messages.
• Implement a task script “task1a.py”, under folder “task1”. [10]
o Experiment how to make 𝑀 a learnable model parameter and using SGD to optimise this more
flexible model.
o Report, using printed messages, the optimised 𝑀 value and the mean (and standard deviation) in
difference between the model-predicted values and the underlying “true” polynomial curve.
Task 2 A depth-wise separable convolution
For the purpose of the coursework, the dataset is only split into two, training and test sets.
COMP0197 CW1
4
• Adapt the Image Classification tutorial to use a different network, VisionTransformer. You can choose
any configuration that is appropriate for this application. [5]
o TensorFlow version
o PyTorch version
• Implement a data augmentation class MixUp, using the mixup algorithm, such that: [10]
o Inherited from the relevant classes in TensorFlow/PyTorch is recommended but not assessed.
o The MixUp algorithm can be applied to images and labels in each training iteration.
o Have an input flag “sampling_method” and appropriate hyperparameters for two options:
▪ sampling_method = 1: λ is sampled from a beta distribution as described in the paper.
▪ sampling_method = 2: λ is sampled uniformly from a predefined range.
▪ The algorithm should be seeded for reproducible results.
o Visualise your implementation, by saving to a PNG file “mixup.png”, a montage of 16 images
with randomly augmented images that are about to be fed into network training.
o Note: the intention of this task is to implement the augmentation class from scratch using
only TensorFlow/PyTorch basic API functions. Using the built-in data augmentation classes
may result in losing all relevant marks.
• Implement a task script “task.py”, under folder “task2”, completing the following: [15]
o Train a new VisionTransformer classification network with MixUp data augmentation, for
each of the two sampling methods, with 20 epochs.
o Save the two trained models and submit your trained models within the task folder.
o Report the test set performance in terms of classification accuracy versus the epochs.
o Visualise your results, by saving to a PNG file “result.png”, a montage of 36 test images with
printed messages clearly indicating the ground-truth and the predicted classes for each.
Task 3 Ablation Study
Using the Image Classification tutorial, this task investigates the impact of the following modification to
the original network. To evaluate a modification, an ablation study can be used by comparing the
performance before and after the modification.
• Difference between training with the two λ sampling methods in Task 2.
• Implement a task script “task.py”, under folder “task3”, completing the following: [30]
o Random split the data into development set (80%) and holdout test set (20%).
o Random split the development set into train (90%) and validation sets (10%).
o Design at least one metric, other than the loss, on validation set, for monitoring during
training.
o Train two models using the two different sampling methods.
o Report a summary of loss values, speed, metric on training and validation.
o Save and submit these two trained models within the task folder.
o Report a summary of loss values and the metrics on the holdout test set. Compare the results
with those obtained during development.
版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:821613408 微信:horysk8 电子信箱:[email protected]
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。