Gebruiker:Andre Engels/Slide 4

Main library files bewerken

Wikipedia.py bewerken

Page and Site classes

Page: represents a page

  • Loading
  • Saving
  • Analyzing
    • Existence
    • Links
    • Categories
    • Interwiki
    • isRedirect
    • isDisambiguation
  • Moving/deleting
  • Get references

Other functions include:

  • uploading multiple pages
  • throttling

Family.py bewerken

  • URLs
  • Namespace names

Family file bewerken

Defined per site

  • hostname
  • directory structure
  • existing languages
  • policy information

Config.py bewerken

  • how to output (screen, logs,...)
  • throttling times
  • ... and more

User_config.py bewerken

Defined by the user

  • Sites with logins
  • Login names
  • Overwrite config.py

Other library files bewerken

  • pagegenerators.py: 'generators' of pages
  • xmlreader.py: interpret special:Export
  • mediawiki_messages.py: reading, interpreting and storing Special:Allmessages
  • login.py: logging in (cookies are kept)
  • terminal_interface.py: output to screen
  • catlib.py: Category class
  • date.py: title of pages for years, dates, centuries etc.
  • sqldump.py: read an SQL dump

Gebruiker:Andre Engels/Slide 5