Exercises

  • Combine extract_author_names.py and top_authors.py into a single script so that the intermediate files aren't needed.
  • Give your best shot at salvaging some of the vote entries that were discarded in the above scripts.
  • Display a list of author names who got at least 10 votes in 2021 but less than 5 votes in 2019.
    • You'll have to fuzzy match the author names since the spelling that won could be different between the two lists.
  • Find out top-5 authors who had at least 5 votes in both the lists and had the biggest gain in 2021 compared to the 2019 data. You can decide how to calculate the gain — vote count or percentage increase.

Further Reading