BC MACHINE LEARNING

Saved by @zmbnski

Roger Grosse
@RogerGrosse 7 years, 7 months ago 3767 views

Save to PDF Share See On Twitter

Important paper from Google on large batch optimization. They do impressively careful experiments measuring # iterations needed to achieve target validation error at various batch sizes. The main "surprise" is the lack of surprises. [thread]

https://t.co/7QIx5CFdfJ

The paper is a good example of lots of elements of good experimental design. They validate their metric by showing lots of variants give consistent results. They tune hyperparamters separately for each condition, check that optimum isn't at the endpoints, and measure sensitivity.

They have separate experiments where the hold fixed # iterations and # epochs, which (as they explain) measure very different things. They avoid confounds, such as batch norm's artificial dependence between batch size and regularization strength.

When the experiments are done carefully enough, the results are remarkably consistent between different datasets and architectures. Qualitatively, MNIST behaves just like ImageNet.

Importantly, they don't find any evidence for a "sharp/flat optima" effect whereby better optimization leads to worse final results. They have a good discussion of experimental artifacts/confounds in past papers where such effects were reported.

The time-to-target-validation is explained purely by optimization considerations. There's a regime where variance dominates, and you get linear speedups w/ batch size. Then there's a regime where curvature dominates and larger batches don't help. As theory would predict.

Incidentally, this paper must have been absurdly expensive, even by Google's standards. Doing careful empirical work on optimizers requires many, many runs of the algorithm. (I think surprising phenomena on ImageNet are often due to the difficulty of running proper experiments.)

More from Machine learning

Pratham Prasoon
@PrasoonPratham

Machine Learning for the Web developer in 2021.

The beginner's guide.

🧵👇

I started machine learning as a web developer, if I can do it then anyone can.

This carefully curated thread will give you key insights into my journey and how you can make this transition, seamlessly.

(2 / 14)

"Machine learning is not what you think it is"

One of the main reasons why people find it difficult to get started with machine learning is because of the lack of information, and rightfully so.

(3 / 14)

Machine learning as a concept has existed since the 1950s, but has only become popular in recent years because of the exponential rise of advancements in computer hardware.

(4 / 14)

In short, it because of the sudden rise of this technology ,which was previously unknown to the general public, that there is a lot of misinformation around it.

(5 / 14)

Pratham Prasoon
@PrasoonPratham

Which libraries do you really need to get started with Machine Learning and why?

🧵👇

First and foremost, make sure that you have nailed the fundamental concepts of Python because machine learning requires a lot of programming!

(2 / 19)

If you know these topics, then you are good to go with machine learning in Python

- Object-oriented programming in Python:Classes,Objects,Methods
- Lists & List functions
- List comprehension
- List slicing
- String formatting
- List,Dictionaries & Tuples

(3 / 19)

Now, let's understand what popular python machine learning libraries do.

We will talk about👇
- TensorFlow (+ Keras)
- PyTorch
- Pandas
- Numpy
- Matplotlib
- SciKit Learn
- Seaborn

(4 / 19)

Now let's cover the libraries that you have to learn (at least the basics) for machine learning

1. Pandas

Pandas is a python library that allows you to store and read data from spreadsheets ( .csv, .xlsv files ) in structures called Dataframes.

(5 / 19)

Advertisement

Santiago
@svpino

For a long time, I didn't understand how to use Virtual Environments in Python 🐍.

If this is just, let's end it here and now: 🧵👇

[2] Virtual Environments let you deal with the dependencies that your code has with external Python libraries.

It avoids having conflicts when your projects depend on different versions of the same library.

👇

[3] Let's imagine that you are building your first Python project and you install the "requests" library:

pip install requests

You get version 2.24.0 installed in your system.

👇

[4] A month later, you decide to work on your second project. It also needs the "requests" library.

But the latest version is not 2.24.0 anymore.

Now version 3 is available, and that's the one you want to use!

👇

[5] You could upgrade your entire system to version 3, but then you'll be potentially breaking the first project you built that depends on 2.24.0!

Can you imagine this happening on a server with many more applications running?

👇

$Fionna O'Leary, \U0001f56f\U0001f1ea\U0001f1fa$

Fionna O'Leary, 🕯🇪🇺...
@fascinatorfun

Thanks for this incredibly helpful analysis @dgurdasani1

Two questions. 1/ Does this summarise the AZ published data :
The plan is to extend the time interval for all age groups despite it being largely untested on the over 55yrs, although the full data is not yet published

SUMMARY: the Oxford/Astra trial examined dosing with gaps between 4-12 wks- although longer gaps appear to be limited mostly to younger participants. There was no difference reported in published data between these & efficacy from the 1st dose seems high for severe disease.
— Deepti Gurdasani (@dgurdasani1) December 31, 2020

Do we have the actual numbers of over 55yr olds given a 2nd dose at c12 weeks and the accompanying efficacy data?

Not to mention the efficacy data of the full first dose over that same period?

I’d quite like to know whether I am to be a guinea pig & the ongoing risks to manage

You attached photos of excerpts from a paper. Could you attach the link?

Re Pfizer. As I understand it the most efficacious interval for dosing was investigated at the start of the trial.

Discussions of 1 vs 2 doses suggest many are not aware of Pfizer's trials which evaluated 1 vs 2 dose immunogenicity, assessed multiple formulations (BNT162b1 BNT162b2 etc) & conducted dose-ranging in both young & old adults at the start. Saw "clear benefit of booster at day 21" pic.twitter.com/mpyxu9xFSF
— Dr Nicole E Basta (@IDEpiPhD) December 31, 2020

Here’s the link to the

I’ve got to say that this way of making and announcing decisions is not inspiring confidence in me and I am very pro vaccination as a matter of principle, not least because my brother caught polio before vaccinations available.

Santiago
@svpino

11 key concepts of Machine Learning.

— Supervised Learning Edition —

🧵👇

😜

Before starting, remember that, if you follow me, one of your enemies will be immediately destroyed (and you'll get to read more of these threads, of course.)

And if you don't follow me, well, you just hurt my feelings.

😜

1. Labels

(Also referred to as "y")

The label is the piece of information that we are predicting.

For example:

- the animal that's shown in a picture
- the price of a house
- whether a message is spam or not

👇

2. Features

(Also referred to as "x")

These are the input variables to our problem. We use these features to predict the "label."

For example:

- pixels of a picture
- number of bedrooms of a house
- square footage of a house

👇

3. Samples

(This is also known as "examples.")

A sample is a particular instance of data (features or "x.") It could be "labeled" or "unlabeled."

👇

You May Also Like

Jaya_Upadhyaya
@Jayalko1

MEENAKSHI AMMAN, MADURAI (TN)
#bharatmandir #navratri2021

A paadal Petra sthalam where Shiva took the form of Sundareswarar (the handsome one) and married Devi Parvati (Meenakshi).
Devi is also known by the name Angayarkanni (mother with the beautiful fish eyes).
@GunduHuDuGa

Devi Meenakshi emerged from yagna fire as a 3 year old girl when Pandyan King Malayadwaja and Kanchanamalai were praying for a child.
It is said that Devi was born with three breasts and there was a prophesy that her superfluous breast would melt away when she met her husband.

Devi ruled over Madurai and captured Indralok. She went on to capture Kailasha. When she saw Shiva, her 3rd breast disappeared and she realised that Shiva would be her consort.
The divine marriage was attended by all Devas and Sri Vishnu (as her brother) gave her hand to Shiva.

It is said that Indra found a swayambhu lingam at Kadamba Vanam. He placed it in Madurai where the kshetram stands. Here, Shiva is seen on the vehicle of Indra.
The Golden Lotus Tank is said to be the place where a golden lotus blossomed for the puja performed by Indra.

The Meenakshi Thirukalyanam festival is celebrated in the Chithirai month to mark the divine marriage of Meenakshi Amman. The festival includes a procession where Meenakshi and Sundareshwara travel in a chariot and Sri Vishnu gives away his sister in marriage to Shiva.

Noah Smith
@Noahpinion

1/OK, data mystery time.

This New York Times feature shows China with a Gini Index of less than 30, which would make it more equal than Canada, France, or the Netherlands. https://t.co/g3Sv6DZTDE

That's weird. Income inequality in China is legendary.

Let's check this number.

2/The New York Times cites the World Bank's recent report, "Fair Progress? Economic Mobility across Generations Around the World".

The report is available here:

3/The World Bank report has a graph in which it appears to show the same value for China's Gini - under 0.3.

The graph cites the World Development Indicators as its source for the income inequality data.

4/The World Development Indicators are available at the World Bank's website.

Here's the Gini index: https://t.co/MvylQzpX6A

It looks as if the latest estimate for China's Gini is 42.2.

That estimate is from 2012.

5/A Gini of 42.2 would put China in the same neighborhood as the U.S., whose Gini was estimated at 41 in 2013.

I can't find the <30 number anywhere. The only other estimate in the tables for China is from 2008, when it was estimated at 42.8.

Advertisement

$MaMaMia \U0001f1ec\U0001f1f7\U0001f1e8\U0001f1fe\U0001f1e6\U0001f1f2 SupportGreekProducts$

MaMaMia 🇬🇷🇨🇾🇦🇲 Support...
@MaMaMia73189983

Following @BAUDEGS I have experienced hateful and propagandist tweets time after time. I have been shocked that an academic community would be so reckless with their publications. So I did some research.
The question is:
Is this an official account for Bahcesehir Uni (Bau)?

Bahcesehir Uni, BAU has an official website https://t.co/ztzX6uj34V which links to their social media, leading to their Twitter account @Bahcesehir

BAU’s official Twitter account

BAU has many departments, which all have separate accounts. Nowhere among them did I find @BAUDEGS
@BAUOrganization @ApplyBAU @adayBAU @BAUAlumniCenter @bahcesehirfbe @baufens @CyprusBau @bauiisbf @bauglobal @bahcesehirebe @BAUintBatumi @BAUiletisim @BAUSaglik @bauebf @TIPBAU

Nowhere among them was @BAUDEGS to find

Jaya_Upadhyaya
@Jayalko1

GHATI SUBRAMANYA, DODDABALLAPURA (KAR)
#bharatmandir

One of the most important place for snake worship.
The murtis of Kartikeya and Narasimha are self-manifested.
There is an anthill which is opposite the shrine. Bhaktas pour milk on it as part of rituals.

Here Subramanya performed penance in the form of snake. He sought protection from Narsimha for Nagas from Sri Vishnu’s vehicle Garuda, known for his dislike towards serpents.
The Pushya Suddha Shasti is one of the biggest festivals here believed to be appearance day of Subramanya

It was in this region where Subramanya vanquished Ghatikasura, the demon. Hence called Ghati Subramanya.
Childless couples visit here to seek blessings for getting a child.
The installation of snake murtis near the shrine is believed to be an auspicious act.

The murti of Karthikeya with a seven headed cobra faces eastwards while Narasimha faces westwards.
To ensure that both are visible to bhaktas at the same time, a huge mirror is placed in the rear in the sanctum sanctorum.

Simon DeDeo
@SimonDeDeo

"I lied about my basic beliefs in order to keep a prestigious job. Now that it will be zero-cost to me, I have a few things to say."

As a dean of a major academic institution, I could not have said this. But I will now. Requiring such statements in applications for appointments and promotions is an affront to academic freedom, and diminishes the true value of diversity, equity of inclusion by trivializing it. https://t.co/NfcI5VLODi
— Jeffrey Flier (@jflier) November 10, 2018

We know that elite institutions like the one Flier was in (partial) charge of rely on irrelevant status markers like private school education, whiteness, legacy, and ability to charm an old white guy at an interview.

Harvard's discriminatory policies are becoming increasingly well known, across the political spectrum (see, e.g., the recent lawsuit on discrimination against East Asian applications.)

It's refreshing to hear a senior administrator admits to personally opposing policies that attempt to remedy these basic flaws. These are flaws that harm his institution's ability to do cutting-edge research and to serve the public.

Harvard is being eclipsed by institutions that have different ideas about how to run a 21st Century institution. Stanford, for one; the UC system; the "public Ivys".