Data Science Austria

Data Science Austria

The last few months I set out to build up to build a news and event aggregator. You can see the work in progress here: data-science-austria.at

Wordpress

Plugins

Here is a list of plugins that I use for the site grouped by the general overall purpose. The first one is a collection that I would generally recommend for any site, i.e. some ssl and security plugins.

The following are more specific and solve content problems.

RSS Feeds

The following RSS feeds are being imported.

Events Feed

Using the automatic import function of “The Events Calendar”-plugin, it reads all upcoming events from my meetup profile. Bigger conferences such as useR! are added by hand.

Machine Learning

I am also working on a feature that auto-tags all my content. The plan looks like this:

  1. Create a proper training set with hand-tagged articles.
  2. Use it to train a model with Azure text analysis service.
  3. Whenever a new article is imported, send it to the text analysis service via email (using a logic app).
  4. Write the tags back to Wordpress via REST-api.

git-ftp

A pretty cool way to work on a website and yet have all the features of version control is called git-ftp

If you use Git and you need to upload your files to an FTP server, Git-ftp can save you some time and bandwidth by uploading only those files that changed since the last upload.

It keeps track of the uploaded files by storing the commit id in a log file on the server. It uses Git to determine which local files have changed.

You can easily deploy another branch or go back in the Git history to upload an older version.

Here is a small extract of how to use it:

# Setup
git config git-ftp.url "ftp://ftp.example.net:21/public_html"
git config git-ftp.user "ftp-user"
git config git-ftp.password "secr3t"

# Upload all files
git ftp init

# Or if the files are already there
git ftp catchup

# Work and deploy
echo "new content" >> index.txt
git commit index.txt -m "Add new content"
git ftp push
# 1 file to sync:
# [1 of 1] Buffered for upload 'index.txt'.
# Uploading ...
# Last deployment changed to ded01b27e5c785fb251150805308d3d0f8117387.