一些与Twitter Cleaning有关的帖子

处理与Twitter相关数据的手段,持续更新ing

当利用Twitter API下载数据报错时

https://developer.twitter.com/en/support/twitter-api/error-troubleshooting

一些主要错误:

404 V2

  • The URI requested is invalid or the resource requested, such as a user, does not exist.
  • Check that you are using valid parameters and the correct URI for the endpoint you’re using.

406 V2

  • Returned when an invalid format is specified in the request.
  • Generally, this occurs where your client fails to properly include the headers to accept gzip encoding, but can occur in other circumstances as well.

429 V2(最令人作呕的报错)

  • Too Many Requests
  • Returned when a request cannot be served due to the App’s rate limit having been exhausted for the resource. See Rate Limiting.
  • 对于429错误要万分小心,一定记得提前写好排雷代码

有关其他error的原因的解释:https://developer.twitter.com/en/support/twitter-api/error-troubleshooting

利用正则表达式删除tweet中的url

https://*.com/questions/24399820/expression-to-remove-url-links-from-twitter-tweet

利用正则表达式清洗Tweet文本

https://github.com/ziishaned/learn-regex/blob/master/translations/README-cn.md

  • 去掉RT、@user_name、url和emoji
上一篇:tortoisegit 还原到某个版本


下一篇:Redis学习