I was on Github looking at something else (as always) when I saw someone I follow forking an R repository called Polite. Dmytry’s markdown to explain the pillars of politeness are well written and absolutely applicable to the wider corners of scraping that a lot of analysts do using tools like Alteryx.
Author: fragiledata
The Ethics of Overtime in Technology
Recently, I took on the mantle of teaching data ethics and security to colleagues who join my company. The training runs over a whole day, but I still feel there’s so much I can’t cover. Making some tea this morning, I realised one of the really important ones was an item that comes up at work under the radar a lot, which is the ethics of overtime.
NYPD Green? Visualising Police Spend in Selected International Cities
One of the clearer roots of the problem is the wretched state of American policing, which has been in dire need of reform for longer than I’ve been alive; a rallying call of the 2020 movement has been “Defund the Police”. I’ve not looked into police spending to know how much is too much, so I decided to do a comparative analysis of some American and international cities to see what their police budgets looked like.
Should I Scrape This? Planning Notes on Inspecting Websites
Webscraping can vary between wildly hard (purposefully or otherwise) and being the notch below an open API to extract data. However, there are often some hints you can use to check whether or not a particular site will be one of the easy or hard ones.
2019-nCoV: Spread In Chinese Provinces
I actually don’t spend that much time working with Tableau, so in the last year I’m a little out of practice. I went back to my training to restart my practice; do something you’re passionate about, and draw it out first.
AWS Summit, London 2019: How does Amazon Leadership drive Innovation?
On Wednesday my esteemed colleague @liluns and I went to the AWS Summit in London, which is several thousand people crammed into one end of the Excel Centre in the Docklands.
Translating Tidy Data Tip: Key Transformation Functions in Alteryx, R and Python
There’s a lot of data where one observation to a human (e.g. one survey) isn’t ideally one observation to the query language of a database system.
March Reading Club: Stories > Data
Sneaking in just before April does, I’ve decided to come out swinging with an frankly uncontroversial idea which may still split my colleagues, but only one this month.
February Reading Club: Bonus for people who don’t like reading
This week we’re going to be listening and reading, rather than completely reading! We’re feeling pretty seasonal with an item about, uh “interpersonal” topics, in time for February, and also for mental health, as February hosts the annual Time to Talk day.
Fragile’s Reading Club: January
This year I’m going to recommend two books every month which have help shape my adventures in data – we’re visiting probability and feedback in January.