Students will (1) understand file system organization, (2) understand and use both absolute and relative names for files and folders, and (3) be experienced in using the command line to create, move, rename, copy, and delete files and folders.
Read my notes on the Command Line.
You’ve heard that data scientists are in demand and would like to practice some skills used by these power users, such as—no kidding—cleaning up bad data. You also like movies, and have found a messed up data set. How perfect! You can use your film knowledge while developing one of the signature talents of power users: the command line.
As your job is to fix up some messy data, I’ve provided you the bad data set in a GitHub repository that you will import. At GitHub, select Import Repository. Then:
The import might take a few minutes. When done, clone the repo, change to your working directory, and get ready to work.
Browse the folders and files in the repo and note the problems. Put everything right, using only the command line, and keeping track of the commands you use to fix things up. You may use your computer’s file system browser (File Manager, Explorer, etc.) to observe your activity, but you may not use the browser to create, move, rename, copy, delete, or otherwise modify the file system. Your cleaned data must:
curl
commandRecord the sequence of commands you used to organize the files; you’ll be responsible for listing the commands as part of the required submission for this lab (instructions are below). Don’t worry about efficiency—you should feel free to explore! Include commands such as ls
or pwd
that helped you to figure out where everything was located.
Finally, you must update the README.md file that you imported. The imported README states that the files are messed up; your fixed up README should be rewritten and state something along the lines of the repository being a set of folders and files for movies that were fixed up as part of a command line exercise.
Online:
On hardcopy or email, answers to the following:
git clone
, that you used to carry out the required lab activities. Your command list should (1) include a few cd
commands and (2) demonstrate that you can use both absolute paths and relative paths./ ├── g1 │ └── m1 │ ├── a │ ├── b │ └── c └── g2 ├── m2 └── m3 └── d