testing my program (python)

User Generated

z6nn6

Computer Science

Description

Hello!

i have a program that I need to make sure that it works. Because im not good with libraries: this is the requirements : Screen Scraper (2).docx 

and this is the program : scapper (1).zip   

Please also i if you can fix it and tech me how make it work. 


thank you

Unformatted Attachment Preview

ITEC 423 A Simple Screen Scraper In this exercise we will create a simple screen scraper using the python language that extracts links from a given web site of a depth of two (can be modified later if desired), from an initial start page. Links will be extracted in a breadth-first fashion. This scraper will use the Beautiful Soup API that will be helpful to separate the HTML code from the page content. It works well for parsing pages with broken HTML. See Blackboard for more information to download Beautiful Soup and for working with python. I will have a few demonstrations examples in class and will distribute sample code. For this assignment we will modify the original “searchengine.py” file to include a function that stores all of the extracted links into a file. We will then read each link, visit the page and extract the content only from each page (stripping out all HTML tags). You will need to create a function (or modify the existing calc() function) to include a word frequency component. The word frequencies of the words in each page should be printed to the screen and to a file. If you have extra time, feel free to add other components and features into your program. Turn in: 1. Your complete python code (you do not need to submit the code for Beautiful Soup) 2. The output file where the URL’s were stored 3. The output from the program with the results of the word frequency counter
User generated content is uploaded by users for the purposes of learning and should be used following Studypool's honor code & terms of service.

This question has not been answered.

Create a free account to get help with this and any other question!

Similar Content

Related Tags