Skip to main content

keywords extract

Project description

keywords extract


英文关键词抽取 支持方法:

  • tfIdf
  • yake
  • topicRank
  • textRank
  • keyBert

Install

pip install keywords_en

或者

pip install -r requirements_dev.txt
pip install -e git+https://github.com/smirnov-am/pytopicrank.git#egg=pytopicrank
pip install git+https://github.com/LIAAD/yake


python setup.py install

示例

from keywords_en import KeyWordsEn
print(dir(KeyWordsEn))


text = 'The Free Software Foundation (FSF) is a nonprofit with a worldwide mission to promote computer user freedom. Escape to Freedom now also available in Mandarin and Spanish "Escape to Freedom" is a new animated video from the Free Software Foundation (FSF), giving an introduction to the concepts behind software freedom: both what we gain by having it, and what rights are at stake. We now have the video available in Mandarin and Spanish language tracks. 2022 Bulletin: "Unjust Algorithms" by Zoë Kooyman Developments in artificial intelligence (AI) injustices have rapidly taken a turn for the worse in recent years. Algorithmic decision-making systems are used more than ever by organizations, educational institutions, and governments looking for ways to increase understanding and make predictions. The Free Software Foundation (FSF) is working through this issue, and its many scenarios, to be able to say useful things about how this relates to software freedom. Our call for papers on Copilot was a first step in this direction. Free software means that the users have the freedom to run, edit, contribute to, and share the software. Thus, free software is a matter of liberty, not price. We have been defending the rights of all software users for the past 35 years. Help sustain us for many more; become an associate member today. Subscribe to our monthly newsletter, the Free Software Supporter: Our initiatives Defective by Design is a grassroots campaign to eliminate Digital Restrictions Management (DRM) in media and devices. Read the Email Self-Defense Guide to get started with email encryption, a skill necessary to combat bulk surveillance. The End Software Patents initiative fights to abolish software patents around the world. Join us in calling for a Web that respects our freedom by being compatible with free software and stand up against nonfree JavaScript. The Free Software Directory is a collaborative catalog of computer programs and apps that are fully free. The GNU operating system is a continuously evolving, complete operating system made entirely of free software. LibrePlanet is our global network of free software activism, including events like our annual conference, and online collaboration spaces. The Licensing and Compliance Lab is the preeminent resource for public education on licensing best practices and enforcing the GPL. The "Respects Your Freedom" program certifies retailers who sell hardware in a manner that respects the rights of their users. "This community that we have, that were building, that does so much, has to grow. We cant compete with Apple, we cant compete with Google, directly, in the field of resources. What we can eventually do is head count and heart count. We can compete on the ground of ideology because ours is better." -- Edward Snowden, NSA whisteblower, speaking at LibrePlanet 2016. Take Action Bulletin Old but not forgotten Unjust Algorithms Verifying free software: The basics The need for free software education now Overcoming the hurdle of "industry standard" in education technology Read the current issue of the Bulletin and check out the archives. Shop Beat the summer heat with a stylish GNU baseball cap!'

algo_name = "tfIdf"
kw = KeyWordsEn(algo_name=algo_name, text=text)
print(kw.keywords)


algo_name = "yake"
kw = KeyWordsEn(algo_name=algo_name, text=text)
print(kw.keywords)

algo_name = "topicRank"
kw = KeyWordsEn(algo_name=algo_name, text=text)
print(kw.keywords)

algo_name = "textRank"
kw = KeyWordsEn(algo_name=algo_name, text=text)
print(kw.keywords)

algo_name = "keyBert"
kw = KeyWordsEn(algo_name=algo_name, text=text)
print(kw.keywords)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keywords_en-0.1.3.tar.gz (14.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

keywords_en-0.1.3-py2.py3-none-any.whl (18.2 kB view details)

Uploaded Python 2Python 3

File details

Details for the file keywords_en-0.1.3.tar.gz.

File metadata

  • Download URL: keywords_en-0.1.3.tar.gz
  • Upload date:
  • Size: 14.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.11 tqdm/4.64.0 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.8

File hashes

Hashes for keywords_en-0.1.3.tar.gz
Algorithm Hash digest
SHA256 50836d5a44a4e927713a04665f44ac506f448f150661867efd6c3e3d2284c28e
MD5 a878fef42e6014a2a45456c9e3f02328
BLAKE2b-256 c8ea4493168fe0781c80001f77c3635015c5465c87763fab673e9ddfb3b1c7b0

See more details on using hashes here.

File details

Details for the file keywords_en-0.1.3-py2.py3-none-any.whl.

File metadata

  • Download URL: keywords_en-0.1.3-py2.py3-none-any.whl
  • Upload date:
  • Size: 18.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.11 tqdm/4.64.0 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.8

File hashes

Hashes for keywords_en-0.1.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 2c8fbaf8671f1ad44e702132c97316b17447eeea5ebcef09e7dcc3632c596e60
MD5 939780f3a301e81990cc06c24b2f7443
BLAKE2b-256 9d88c7983ad2721d8d75c59894812e8c412d8458a3de9e8aa5513916e49ccfc3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page