Notes on “The First Room-Temperature Superconductor Has Finally Ben Found by sciencenews.org”

  • Title:: The First Room-Temperature Superconductor Has Finally Been Found
  • Author:: [[sciencenews.org]]
  • Recommended By:: [[Tyler Cowen]]
  • Reading Status:: #complete
  • Review Status:: #complete
  • Tags:: #articles #superconductor #technology #innovation #[[new technology]]
  • URL:: https://www.sciencenews.org/article/physics-first-room-temperature-superconductor-discovery
  • Source:: #instapaper
  • Roam Notes URL:: link
  • Anki Tag:: science_news_room_temp_superconductor
  • Anki Deck Link:: link
  • Notes

    • Scientists reported the discovery of the first room-temperature [[superconductor]], after more than a century of waiting. (View Highlight)
    • Superconductors transmit electricity without resistance, allowing current to flow without any energy loss. But all superconductors previously discovered must be cooled to very low temperatures, making them impractical. (View Highlight) #Ankified
    • If a room-temperature [[superconductor]] could be used at atmospheric pressure (the new material only works at very high pressure), it could save vast amounts of [[energy]] lost to resistance in the [[electrical grid]]. And it could improve current technologies, from [[MRI machines]] to [[quantum computers]] to [[magnetically levitated trains]]. Dias envisions that humanity could become a “superconducting society.” (View Highlight)
    • It’s a big advance, but practical applications still a long way off.

Roam Notes on “Patrick Collison in conversation with Tyler Cowen | Full Q&A | Oxford Union Web Series”

https://www.youtube.com/watch?v=wfdRF_krbp8
  • Author:: [[Tyler Cowen]] and [[Patrick Collison]]
  • Source:: link
  • Recommended By:: [[Tyler Cowen]]
  • Tags:: #technology #progress
  • Roam Notes URL:: link
  • {{[video]: https://www.youtube.com/watch?v=wfdRF_krbp8}}
  • (0:29) [[Tyler Cowen]]: As the next big biomedical technology breakthroughs come, are you concerned that increased life expectancy would result in calcification of institutions by entrenching incumbents?
    • [[Patrick Collison]]: It’s a problem to be solved, but not convincing because proposing the inverse "can we ensure everyone dies at age 80?" seems to clearly be "no".
  • (1:35) [[Tyler Cowen]]: To what extent do you think the attraction of progress is "feel / aesthetics" or giving people what they want?
    • PC: Correlation between happiness and GDP is about .78. So progress really does drive satisfaction. That suggests it’s more about the outcome rather than the process, but my intuition is that it’s the process of generating progress itself that is the relevant question.
  • 3:43 [[Tyler Cowen]]: You’ve written both optimistic and pessimistic visions for our path forward with technology. What is your underlying model?
  • (5:50) [[Tyler Cowen]]: The [[mRNA vaccines]] work, and there was at least 25 years where there was no marketplace adoption. All the sudden paradise rains down during [[COVID-19]] – maybe this is how progress works and we shouldn’t be so pessimistic?
    • [[Patrick Collison]]: You could argue the opposite – the fact that we needed a pandemic to finally get to commercialization is an indicator of systemic problems. The fact they were so ready to deploy, indicates the extent of the problem.
  • (7:43) [[Tyler Cowen]]: What is the most misleading statistic and what is the most underrated statistic for measuring progress?
    • [[Patrick Collison]]: Self-reported happiness is important but a lot of the comparisons you want to perform with it are fraught or misleading. Intertemporal comparisons lead to strange conclusions.
    • (10:15) [[Tyler Cowen]]: part of me thinks total [[population]] may be the ultimate measure of progress, which would not be good for [[Japan]]. Everyone admires small countries that are well run, but consider [[Brazil]] – obviously lots of problems, not as well run, but it’s produced many people.
  • (12:30) [[Patrick Collison]]: Culture is very important for determining progress. If you look at [[the Scottish Enlightenment]], they were very obsessed with things like [[culture]], [[norms]], and mindset, which seems old-fashioned now.
  • (13:30) [[Patrick Collison]]: [[Africa]] has a promising future because of the internet: the people there are suddenly able to compete there on the same level as other places in the world. They also have a significantly growing and young population.
    • [[Tyler Cowen]]: Another advantage – since there are many more countries there, they can run more experiments.
    • [[Patrick Collison]]: They understand the importance or progress better than many westerners, who tend to now have a complacent, postmodern view that it’s not that important.
  • (20:40) [[Patrick Collison]]: [[Ireland]] has a bit of an inferiority complex, so it doesn’t view itself as the best at everything, but this kind of attitude can help stoke progress.
  • (22:04) [[Patrick Collison]]: [[Mathematica]] is one of the most underrated achievements of our age. #programming
    • It’s been getting steadily better over multiple decades. Programming languages don’t innovate much after they’re released, partially because they’re [[open source software]] which can make it harder to make significant changes. Mathematica shows that a multi-decadal software project is totally sensible – it’s improving at a faster rate now than it ever has.
    • Mathematica is like [[Stripe]] in that they are both sort of programming languages – one for computing, one for financial infrastructure. Developer productivity is the primary focus for both. [[Stephen Wolfram]] is also admirable and ambitious. He doesn’t believe in libraries – he believes that your programming language should just do all the things! It’s like he’s building the Library of Alexandria in the programming language.
  • (26:00) [[Tyler Cowen]]: You showed an early interest in meta-programming languages such as [[Lisp]]. Why, and what does that show about your thought generally? #programming #[[functional programming]]
    • [[Patrick Collison]]: Two things:
      • In computing we’re stuck in these local maxima and there’s an entrenched status quo. The cost is probably much greater than people realize. Off the beaten path projects like Lisp and Mathematica helped to understand the design space and what was possible. #learning
      • Lisp is a programming language for individuals. It takes seriously the question "how do you make a single individual as enabled and productive as possible?" E.g. "reader macros" where you define on the fly the actual syntax of the language. To other programmers this is a disaster – how do you have a large project where you get a bunch of people to work on random syntax you defined? [[Stripe]] takes this individual view: how can we make it possible for one person to do build a business with financial payments in one evening?
  • (29:20) [[Tyler Cowen]]: Why is Stripe a [[writing]] company? And how does this spring from your love of [[Lisp]] and [[Mathematica]]?
    • [[Patrick Collison]]: If you take ideas seriously, you have to become a writing culture. You want to find the best solution, not something that "just works". We’re still debating fundamental questions at Stripe that have been around for years. To make progress on that, you have to be a writing culture. If you don’t write ideas down extensively or specifically, it’s hard to say that they’re wrong and you can’t make progress.
  • (42:16) [[Patrick Collison]] the prevalence of [[open office plans]] has a lot to do being able to shuffle around people easily in a high growth company. Three unique strategies of [[Stripe]] in terms of creating an optimal [[work environment]]:
    • Move teams quickly (every 3-6 months switch to a new location)
    • Move unrelated teams close together (for serendipity, creating a warm atmosphere)
    • Making the entire physical space as connected as possible (e.g. central stairwells to get as much of serendipitous interaction as possible).
  • (48:45) [[Patrick Collison]]: It’s actually hard to get funding at top universities with large endowments. A lot of the best [[Fast Grants]] applications were actually from people from top universities, so there are potentially high returns to improving funding for the best researchers. #[[research funding]]
  • (52:20) [[Tyler Cowen]] How should we better run funding institutions, and why is there so much [[conformism]] in universities / nonprofit / philanthropy, and how does all that tie together? #[[research funding]]
    • [[Patrick Collison]]: for science institutions, more structural diversity. In terms of which work is being funded, what the field delineations are, different models for how careers work or where work is done. Find all of the axes where you could try new and different things. A lot of people don’t realize how monochromatic it is – so much is downstream of institutions like [[NIH]] – researchers understand how stifling this is, but don’t speak out about it because they rely on the funding.

For access to my shared Anki deck and Roam Research notes knowledge base as well as regular updates on tips and ideas about spaced repetition and improving your learning productivity, join “Download Mark’s Brain”.

Roam Notes and Anki Deck on “Notes on Technology in the 2020s” by Eli Dourado

  • Title:: Notes on Technology in the 2020s
  • Author:: [[Eli Dourado]]
  • Recommended By:: [[Patrick Collison]] on Twitter https://twitter.com/patrickc/status/1345405936240742400
  • Reading Status:: #complete
  • Review Status:: #[[third pass]]
  • Tags:: #articles #technology #progress #innovation
  • URL:: https://elidourado.com/blog/notes-on-technology-2020s/
  • Source:: #instapaper
  • Anki Tag:: dourado_2020_tech
  • Anki Deck Link:: link
  • Notes

    • Overview
      • [[Eli Dourado]] thinks through how various promising technologies could evolve over the next decade. (View Highlight)
    • End of [[the great stagnation]]?
      • Metric marking the end of [[the great stagnation]] – sustained growth in utilization-adjusted [[total factor productivity]] of 2 percent per year. It was 2.1 percent over 1947-1972, only .17 percent since 2005. (Note: utilization-adjusted version is important since it corrects for the business cycle.) (View Highlight) #Ankified
      • Scientific breakthroughs alone are not enough to end the Great Stagnation. "TFP only budges when new technologies are adopted at scale, and generally this means products, not just science…This means building businesses, surmounting regulatory obstacles, and scaling production. (View Highlight) #commercialization #Ankified
    • [[Biotech]] and [[health]] (View Highlight)
      • [[mRNA vaccines]] provides the ability to encode and deploy arbitrary mRNA in our bodies—"it allows us to essentially program our cells to make whatever proteins we want". For [[COVID-19]], the vaccine instructs our cells to make the spike protein (View Highlight). mRNA technology can be deployed against non-viruses, like [[cancer]] (e.g. [[BioNTech]] treatment). (View Highlight) #[[Moderna]]
      • [[CRISPR]] is a technique for editing [[DNA]] discovered in 2012, but haven’t made a meaningful economic contribution yet—no treatment using CRISPR has been approved outside of [[clinical trials]]. (View Highlight) #Ankified
      • "[[DeepMind]] [[protein-folding]] breakthrough signals a promising decade for the science of [[proteomics]]. Most directly, being able to predict protein shapes will enable us to discover drugs more rapidly." But this is still a way off due to drug trials taking a long time. (View Highlight).
      • [[life extension]]: [[Conboy Lab at Berkeley]] helped prove that replacing plasma rejuvinates germ layer tissues and improves cognition by reducing neuroinflammation. (View Highlight) This is a product that could actually come to market – [[therapeutic plasma exchange]] is [[FDA]]-approved for other conditions (not aging), but could be provided off-label, and it’s cheap – "An automated [[plasmapheresis machine]]—which lets you do treatment after treatment—can be bought online for under $3,000". (View Highlight)
        • Another related product is [[aging clocks]] to know how biologically old you are – these are available today. (View Highlight)
        • [[metformin]] is something to look into if you are metabolically unhealthy. (View Highlight)
      • [[health sensors]] on [[wearables]] like Apple Watch are becoming better and more prevalent every year. (View Highlight)
      • "Let’s salute and cheer for the discoveries, but spare many thoughts for the entrepreneurs trying to bring treatments to market." (View Highlight) #commercialization
    • [[Energy]]
      • [[wind power]] and [[solar power]]: costs of these have decreased significantly over the 2010s but deployment is only 9% of utility-scale electricity generation in the US as of 2019. Going forward, cost reductions will stall, but deployment will increase. (View Highlight) #Ankified
        • Intermittency is a challenge. To reach a grid powered entirely by today’s renewables, we would need storage at a price of $20 per kWh (with caveats). To power the grid today entirely with renewables, would need price to be about $20 per kWh, while current prices are in the $500-$600 per kWh range. Increased demand could make price reductions in the future challenging. (View Highlight)
      • [[nuclear power]] or [[geothermal power]] seem to be required for scalable zero-carbon baseload energy.
        • [[nuclear power]] is challenging due to high costs
        • [[geothermal power]] is the most plausible this decade. This is apparently an area ripe for innovation: "The startups I have spoken to think with today’s technology they can crack 3.5¢/kWh without being confined to volcanic regions." Possibly 1¢/kWh by the 2050s, making it difficult for [[nuclear power]] to compete (View Highlight) #Ankified
      • [[sustainable alternative fuels (SAF)]] will be big in 2020s. Airlines can’t electrify since batteries can’t match fossil fuel energy density, which means airlines must go with [[hydrogen fuel]] or SAF. Dourado is betting on SAF over Hydrogen (esp. fuel made from CO2 from the atmosphere), since they are more energy dense. (View Highlight) #Ankified
    • [[transportation]]
      • [[electric cars]] – they’re better than regular cars due to lower fuel costs, lower maintenance costs (fewer moving parts), faster acceleration, higher low-end torque. (View Highlight) One exception is trucking, which may have to shift to hydrogen. This shift will significantly reduce air pollution from unregulated ultrafine particles; resulting in fewer premature birth, asthma, cancer, and mystery illness.
      • [[autonomous vehicles]] could happen at scale in 2020, and autonomy is inevitable eventually with constantly improving sensors and machine learning algorithms. (View Highlight)
      • [[supersonic aircraft]] will have a big impact on global business when it comes, but this is likely not in the 2020s. (View Highlight) [[urban air mobility]] may also happen (e.g. Joby, Wisk).
      • [[drone delivery]] is likely in the 2020s, with the [[FAA]] about to issue a rule expanding operations at night and flights over crowds. (View Highlight)
      • [[tunnels]] are a possible route in countries like the US where it is extremely difficult to build above-ground due to "promiscuous distribution of the veto power" (View Highlight). [[The Boring Company]] has a couple promising projects here, and Dourado is optimistic about the impact on commerce since time and hassle cost of travel is a key input to the [[gravity model of trade]].
    • [[space]]
      • [[SpaceX]] seems poised to dramatically reduce the cost of space exploration with [[Starship (SpaceX)]]. The Space Shuttle was about $65,000/kg to low earth orbit, [[Falcon 9 (SpaceX)]] is only $2,600/kg, and reasonable estimates suggest Starship could reach $10/kg. (View Highlight) #Ankified
      • Some consequences: commerce between Earth and space expands (e.g. manufacturing materials that can only be made in space, [[Starlink (SpaceX)]]), and less engineering required on payloads due to the consequences of losing them being lower. #[[gravity model of trade]] (View Highlight)
    • [[information technology]]
      • "[[custom silicon]] is going to be huge", due to incredible performance gains. Another name for this is [[system on a chip (SoC)]]. [[Apple M1]] is a notable example. "Almost all computer hardware—anything that has any scale to it—will move in this direction"
    • Conclusion
      • "It all depends on [[execution]]. The underlying science is there. The engineers are willing. Even the funding is available in most cases. But, as a society, how much urgency do we feel? Our culture does not prioritize [[progress]]—it fights, destructively, for [[status]]. And our politics reflects our culture." (View Highlight)

For access to my shared Anki deck and Roam Research notes knowledge base as well as regular updates on tips and ideas about spaced repetition and improving your learning productivity, join “Download Mark’s Brain”.

Notes on Balaji Srinivasan Interview: Technology Will Lead to a Borderless World

Overview: [[Balaji Srinivasan]] does a wide ranging interview with [[Nick Gillespie]] from [[Reason Magazine]] discussing his ideas around #Voice and #Exit, the relationship between technology and the logic of violence, his intellectual heroes, among other topics.

1:45 At least two responses you can make when encountering ossified systems that you don’t like:

  • [[Voice]]: Expressing dissatisfaction (e.g. democracy, revolution)
  • [[Exit]]: Recognizing you won’t be able to change the system, so you leave and start something new.

All progress takes this form: you build something up, and once you get to a certain scale it becomes ossified. At that point, people start to [[Exit]] to go build something better. #progress #innovation

4:35 Technology is reducing the barrier to [[Exit]]. Two ways it does this:

  • Cloud dimension: Because of the cloud, you can earn from anywhere, collaborate from anywhere. Eventually, [[Balaji Srinivasan]] believes this will be taken further using [[cryptocurrency]] and [[VR]].
  • Mobile: Geography becomes less important. Increasingly, you live in an apartment complex and wouldn’t recognize anyone, but you have hour long conversations with people thousands of miles away.

7:30 [[Bitcoin]] and other [[cryptocurrency]] lets you transfer funds without an intermediary. This is a huge win for small payments across borders, which would otherwise have not been feasible due to very high transaction costs.

9:00 [[Silicon Valley]] benefited from having a lot of people close together to create an innovative environment. Can you achieve that same effect virtually?

12:30 Talks about the book “The Sovereign Individual” #[[Book: The Sovereign Individual]]. It’s like the book of prophesies. It’s written in 1999 and many sentences seem like they’re ripped from last week’s headlines and many have yet to happen. Most books you can summarize into a sentence, but this book is the opposite. You take a sentence and you can expand it into a whole book. #[[To Read]]The book talks about how changes in [[technology]] change the logic of [[violence]]. What happens when you can’t see someone’s money? Good things and bad things. Good things: everything becomes voluntary and you can keep money you create. Bad things: it’s hard to track down robbers if they manage to steal some cash. So, more petty crimes. But fewer wars (government’s can’t seize funds).

19:45 Talks about his experience being considered by [[Donald Trump]] as a candidate for the head of the [[FDA]] and what he would have done if he was appointed.

21:30 When nations are on the rise, people are often more willing to take substantial risks to make [[progress]]. If you look at the [[history]] of [[aviation]], [[automobiles]], [[railroads]] a lot of people died. History of [[chemistry]] – the CRC handbook of chemistry and physics from a long time ago has a lot of tastes and smells for new compounds, because people would actually taste and smell new compounds! Wouldn’t want to be the guy that did this for cyanide.

23:45 [[Balaji Srinivasan]]’s intellectual and business heroes. He calls himself a pragmatist and technologist. Avoids political labels, because that can maximize your coalition to build something new.

  • [[Srinivasa Ramanujan]] (mathematician) is one of his intellectual heroes. He would like to build things to identify talent like that, giving them opportunity to rise.
  • [[Lee Kuan Yew]] – a founding father of [[Singapore]]. He wrote a lot and gave many interviews. He had a great book “From Third World To First” #[[To Read]]. His work, and [[Singapore]] generally, defies traditional “left” and “right” political boundaries. This influenced his thought on finding pragmatic best outcomes and not just going down partisan routes. #politics #ideology

30:00 Advice for his young son in this new world: “probably want him to go Satoshi”. You can’t discriminate against Satoshi – competes completely on the basis of ideas. Also, mobs can’t take him down and ruin his reputation by attacking other aspects of their life. #[[Satoshi Nakamoto]] #pseudonymity #Parenting

Digging into Data Science Tools: Docker

Docker is a tool for creating and managing “containers” which are like little virtual machines where you can run your code. A Docker container is like a little Linux OS, preinstalled with everything you need to run your web app, machine learning model, script, or any other code you write.

Docker containers are like a really lightweight version of virtual machines. They use way less computer resources than a virtual machine, and can spin up in seconds rather than minutes. (The reason for this performance improvement is Docker containers share the kernel of the host machine, whereas virtual machines run a separate OS with a separate kernel for every virtual machine.)

Aly Sivji provides a great comparison of Docker containers to shipping containers. Shipping containers improved efficiency of logistics by standardizing the design: they all operate the same way and we have standardized infrastructure for dealing with them, and as a result you can ship them regardless of transportation type (truck, train, or boat) and logistics company (all are aware of shipping containers and mold to their standards). In a similar way, Docker provides a standardized software container which you can pass into different environments and be confident they’ll run as you expect.  

Brief Overview of How Docker Works

To give you a really high-level overview of how Docker works, first let’s define three big Docker-related terms – “Dockerfile”, “Image”, and “Container”:

  • Dockerfile: A text file you write to build the Docker “image” that you need (see definition of image below). You can think of the Dockerfile like a wrapper around the Linux command line: the commands that you would use to set up a Linux system on the command line have equivalents which you can place in a docker file. “Building” the Dockerfile produces an image that represents a Linux machine that’s in the exact state that you need. You can learn all about the ins-and-outs of the syntax and commands at the Dockerfile reference page. To get an idea of what Dockerfiles look like, here is a Dockerfile you would use to create an image that has the Ubuntu 15.04 Linux distribution, copy all the files from your application to ./app in the image, run the make command on /app within your image’s Linux command line, and then finally run the python file defined in /app/app.py:
FROM ubuntu:15.04
COPY . /app
RUN make /app
CMD python /app/app.py
  • Image: A “snapshot” of the environment that you want the containers to run. The images include all you need to run your code, such as code dependencies (e.g. python venv or conda environment) and system dependencies (e.g. server, database). You “build” images from Dockerfiles which define everything the image should include. You then use these images to create containers.
  • Container: An “instance” of the image, similar to how objects are instances of classes in object oriented programming. You create (or “run” using Docker language) containers from images. You can think of containers as a running the “virtual machine” defined by your image.

To sum up these three main concepts: you write a Dockerfile to “build” the image that you need, which represents the snapshot of your system at a point in time. From this image, you can then “run” one or more containers with that image.

Here are a few other useful terms to know:

  • Volume: “Shared folders” that lets a docker container see the folder on your host machine (very useful for development, so your container is automatically updated with your code changes). Volumes also allow one docker container to see data in another container. Volumes can be “persistent” (the volume continues to exist after the container is stopped) or “ephemeral” (the volume disappears as soon as the container is stopped).
  • Container Orchestration: When you first start using Docker, you’ll probably just spin up one container at a time. However, you’ll soon find that you want to have multiple containers, each running using a different image with different configurations. For example, a common use of Docker is deployment of applications as “microservices”, where each Docker container represents an individual microservice that interacts with your other microservices to deliver your application. Since it can get very unwieldy to manage multiple containers manually, there are “container orchestration” tools that automate tasks such as starting up all your containers, automatically restarting failing containers, connecting containers together so they can see each other, and distributing containers across multiple computers. Examples of tools in this space include docker-compose and Kubernetes.
  • Docker Daemon / Docker Client: The Docker Daemon must be running on the machine where you want to run containers (could be on your local or remote machine). The Docker Client is front-end command line interface to interact with Docker, connect to the Docker Daemon, and tell it what to do. It’s through the Docker client where you run commands to build images from Dockerfiles, create containers from images, and do other Docker-related tasks.

Why is Docker useful to Data Scientists?

You might be thinking “Oh god, another tool for me to learn on top of the millions of other things I have to keep on top of? Is it worth my time to learn it? Will this technology even exist in a couple years?

I think the answer is, yes, this is definitely a worthwhile tool for you to add to your data science toolbox.

To help illustrate, here is a list of reasons for using Docker as a data scientist, many of which are discussed in Michael D’agostino’s “Docker for Data Scientists” talk as well as this Lynda course from Arthur Ulfeldt:

  • Creating 100% Reproducible Data Analysis: Reproducibility is increasingly recognized as critical for both methodological and legal reasons. When you’re doing analysis, you want others to be able to verify your work. Jupyter notebooks and Python virtual environments are a big help, but you’re out of luck if you have critical system dependencies. Docker ensures you’re running your code in exactly the same way every time, with the same OS and system libraries.
  • Documentation: As mentioned above, the basis for building docker containers is a “Dockerfile”, which is a line by line description of all the stuff that needs to exist in your image / container. Reading this file gives you (and anyone else that needs to deploy your code) a great understanding about what exactly is running on the container.
  • Isolation: Using Docker helps ensure that your tools don’t conflict with one another. By running them in separate containers, you’ll know that you can run Python 2, Python 3, and R and these pieces of software will not interfere with each other.
  • Gain DevOps powers: in the words of Michaelangelo D’Agostino, “Docker Democratizes DevOps”, since it opens up opportunities to people that used to only available to systems / DevOps experts:
    • Docker allows you to more easily “sidestep” DevOps / system administration if you aren’t interested, since someone can create a container for you and all you have to do it run it. Similarly, if you like working with Docker,  you can create a container less technically savvy coworkers that lets them run things easily in the environment they need.
    • Docker provides the ability to build docker containers starting from existing containers. You can find many of these on DockerHub, which holds thousands of pre-built Dockerfiles and images. So if you’re running a well-known application (or even obscure applications), there is often a Dockerfile already available that can give you a tremendous running start to deploy your project. This includes “official” Docker repositories for many tools, such as ubuntu, postgres, nginx, wordpress, python, and much more.
    • Using Docker helps you work with your IT / DevOps colleagues, since you can do your Data Science work in a container, and simply pass it over to DevOps as a black box that they can run without having to know everything about your model.

Here are a few examples of applications relevant to data science where you might try out with Docker:

  • Create an ultra-portable, custom development workflow: Build a personal development environment in a Dockerfile, so you can access your workflow immediately on any machine with Docker installed. Simply load up the image wherever you are, on whatever machine you’re on, and your entire work environment is there: everything you need to do your job, and how you want to do your job.
  • Create development, testing, staging, and production environments: Rest assured that your code will run as you expect and become able to create staging environments identical to production so you know when you push to production, you’re going to be OK.
  • Reproduce your Jupyter notebook on any machine: Create a container that runs everything you need for your Jupyter Notebook data analysis, so you can pass it along to other researchers / colleagues and know that it will run on their machine. As great as Jupyter Notebooks are for doing analysis, they tend to suffer from the “it works on my machine” issue, and Docker can solve this issue.

For more inspiration, check out Civis Analytics Michaelangelo D’Agostino describe the Docker containers they use (start at the 18:08 mark). This includes containers specialized for survey processing, R shiny apps and other dashboards, Bayesian time series modeling and poll aggregation, as well as general purpose R/Python packages that have all the common packages needed for staff.

Further Resources

If you’re serious about starting to use Docker, I highly recommend the Lynda Course Learning Docker by Arthur Ulfeldt as a starting point. It’s well-explained and concise (only about 3 hours of video in total). I created a set of Anki flashcards from this course you can access here. I also recommend the book Docker Deep Dive by Nigel Poulton. I also created Anki flashcards from this book that you can access here.

Here are a few other useful resources you might want to check out: