Skip to main content

Scraping the vocabulary from the Memrise course

Project description

Features:

  • Support scraping the courses in MEM to take the vocabulary

Appplication Requires

Install DB Browser : SQLite

Install Library:

Window

 python -m pip install memrise

Linux

pip install memrise

macOS

sudo pip3 install memrise

Guidelines

How to take Course ID?

Access the Website: Memrise and copy the Course ID as the following picture:

Import library and initialize database

from memrise import Course, Data
#Create file database output
db = Data('English.sqlite') #Other format is .db
#Connect to file database and init
db.init_database()

Scraping course with ID

Regarding to Module Course with two paramemters:

  • CourseID: Get the Course ID as above
  • LanguageID: The Language ID of the Course which you study.

Where, the LanguageID is define as the followings: The output will give you the List Language's ID of the Course, remember the ID for next step.

Language IDs:        
    1. English UK    
    2. English US    
    3. Chinese       
    4. Janpanese     
    5. French        
    6. Spanish Mexico
    7. Italian
    8. German
    9. Russian
    10. Dutch
    11. Korean
    12. Arabic
    13. Spanish Spain

The following example is scraping the English course for Vietnamese with IPA of English US, so the Language ID is 2.

#Connect the course to scraping info this maybe take a few momment.
course = Course(1658724,2)
#Update information about the course
db.update_course(course)

Update course with your language meaning

Use the method update_db_en() if the LANGUAGE COURSE is English for scraping IPA.
Use the method update_db() if the Language Course is the others.
About the parameters of two above methods are the same:

  • CourseID : the ID of the course
  • Language : your mother language with format 'en', 'fr', 'ko', 'vi'...
#If your Course is English language use `update_db_en()`, otherwise use `update_db()` method.
db.update_db_en(1658724,'fr')

Check the output with SQLite

File output

Show the words table as the following steps: Browse Data > Table > Word

If you want to choose the raw meaning, you could run the following SQL statement.

SELECT word, sub, IPA FROM words

Steps : Execute SQL > Typing SQL Statements > Run

Log changes:

v1.0.0: Implementation Scrapping Vocabulary
v1.1.0: Update IPA Function
v1.2.1 : Release check for fixing ERROR IPA and update new TRANSLATE FUNCTION

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memrise-1.2.1.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

memrise-1.2.1-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file memrise-1.2.1.tar.gz.

File metadata

  • Download URL: memrise-1.2.1.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.11 CPython/3.7.8 Windows/10

File hashes

Hashes for memrise-1.2.1.tar.gz
Algorithm Hash digest
SHA256 9ec3ad8f771ab90a5f20249de2fd3021eb37b114fc8def98cdc12a518d1a82e7
MD5 b3710f8cd7f006ca9c0d6eaac3fffa3c
BLAKE2b-256 a3999520d5a856bffa3de9c6a45a9756937616564ae5be6d02d54ec8b62112d9

See more details on using hashes here.

File details

Details for the file memrise-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: memrise-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.11 CPython/3.7.8 Windows/10

File hashes

Hashes for memrise-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4b6060fa6e2acb9b59435f6ed9d369ac795411f0bb3e3d2bb248abdd47a8083d
MD5 4c401f6160aaffc93c561d5bd484f448
BLAKE2b-256 b856cba74aca407eadca91283af5bb57c9f2fd29c5708dc86f53ce34f251970d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page