SBATCH, Dialog Parsing, & Word Clouds - Proper Nouns


15 Minutes


We just proved out our alcott_dialog_parsing project with a number of srun commands. We also gained additional practice creating SBATCH script files and submitting them using sbatch.

Let’s use what we’ve learned and revisit our analysis - this time extracting and visualizing proper nouns used by each of the characters.


Instructions:


  • Work independently in the main room, posting any questions that arise to slack.
  • Recommendations for writing your own code:
    • Read function documentation
    • Test out ideas - it’s okay to make mistakes and generate errors
    • Use a search engine to look up errors or recommended solutions using keywords
  • We’ll review possible solutions after time is up as a group.


  • Create an SBATCH file that uses singularity exec to run our dialog_parser.py script, this time extracting proper nouns from each character.
    • Pay attention to SBATCH preamble (job name, resources, log file location).
  • Submit the above SBATCH file, view the output, and verify that we have word lists containing proper nouns in our results directory.
  • Create an SBATCH file that creates a word cloud from one of the characters’ extracted proper nouns.
  • Submit that SBATCH file, view the output exists, and view the image in the Great Lakes file browser as well.
  • Create similar SBATCH files for all other characters and submit them.
  • View all of the result images in the Great Lakes file browser.


Link to Great Lakes Dashboard