data-science

There's a small mistake in the description of the embedding layer. It says

'Turns positive integers (indexes) into dense vectors of fixed size.'

but it should read

'Turns non-negative integers (indexes) into dense vectors of fixed size.'

as it expects indexes ranging from 0 to input_dim - 1.

Currently, using OrdinalEncoder with a string-valued feature, and without categories explicitly specifying an order, means that OrdinalEncoder will number the categories according to their lexicographic ordering.

This is not appropriate if the categories have a natural ordering (e.g. ['Green', 'Amber', 'Red']) that can be harnessed by the downstream estimator.

Rather, we should allow the u

I got a conllU file, from my university, where the head column is filled with .
Processing such file with the cli.convert method will result in a int cast error in
https://github.com/explosion/spaCy/blob/master/spacy/cli/converters/conllu2json.py line 73
in the read_conllx method (head = (int(head) - 1) if head != "0" else id).

In the format documentation on https://universaldependencie

Since #11953 was merged a couple of extra simplification can be done:

See in particular this comments.

https://github.com/ipython/ipython/pull/11953/files#r348243309

The value DICT_IS_ORDERED in IPython/lib/pretty.py is always True; any code that reply on it can be simplified; and the value should be documented for future removal.

This is a good issue for a first time contri

I tried to use latex in dash, but it is not working.
It seems that the mathjax javacript library is not loaded.

The usage example in the word2vec.py doc-comment regarding KeyedVectors uses inconsistent paths and thus doesn't work.

https://github.com/RaRe-Technologies/gensim/blob/e859c11f6f57bf3c883a718a9ab7067ac0c2d4cf/gensim/models/word2vec.py#L73

https://github.com/RaRe-Technologies/gensim/blob/e859c11f6f57bf3c883a718a9ab7067ac0c2d4cf/gensim/models/word2vec.py#L76

If vectors were saved to a tm

System information

OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Ubuntu 16.04
Ray installed from (source or binary): pip install ray
Ray version: 0.7.4
Python version: 3.6

Describe the problem

After having successfully trained and restore an agent, one very common use case might be to make deterministic action given a state. After training, or wh

Question

Can PretrainedTransformerTokenizer track character offset like WordTokenizer？
Since character offset is important to calculate answer span after wordpiece tokenization？

i'm a newbie in programming. I try to use this library. it's very useful for me.
I want to show centroid in K-means clustering. how to show it? thank u so much..

When pressing the Enter key in the Wikidata login form from the Wikidata extension, one would expect the form to be submitted, which currently does not happen.

@anargyri

Description

from a conversation with @anargyri:

It would be more appropriate to have a folder called tuning and, under that folder, azureml and nni and the spark tuning code. This would require testing again all the tuning notebooks, so I would leave it for a separate PR. Hyperdrive and NNI rely on several path names, so they will brea

I can not find a guide on choosing TPOT parameters. I know the API is explained in the documents but its too brief. TPOT seems made for users unskilled in ML and GP. I made another issue with my many questions. "We recommend using the default parameter unless you understand how the mutation rate affects GP algorithms. " should have a link.

Today you can put Streamlit in "wide mode" via the Settings dialog in the UI. However, it would be great if the wide mode setting were sticky.

Option 1: just make Wide Mode sticky by persisting it in local storage!

Option 2: Provide a config option that toggles wide mode:

[browser]
wideMode = True

(for this we'd have to replicate much of the code used to propagate settin

Would be great to have new option in Pool. Just like cat_features list of numbers or column names.

Jul	NOV	Dec
	23
2018	2019	2020

data-science

Here are 9,084 public repositories matching this topic...

keras-team / keras

scikit-learn / scikit-learn

CamDavidsonPilon / Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

donnemartin / data-science-ipython-notebooks

explosion / spaCy

ipython / ipython

eriklindernoren / ML-From-Scratch

virgili0 / Virgilio

academic / awesome-datascience

plotly / dash

rasbt / python-machine-learning-book

RaRe-Technologies / gensim

hangtwenty / dive-into-machine-learning

afshinea / stanford-cs-229-machine-learning

ray-project / ray

System information

Describe the problem

tflearn / tflearn

bharathgs / Awesome-pytorch-list

onurakpolat / awesome-bigdata

allenai / allennlp

php-ai / php-ml

OpenRefine / OpenRefine

microsoft / recommenders

Description

EpistasisLab / tpot

Yorko / mlcourse.ai

lexfridman / mit-deep-learning

streamlit / streamlit

rasbt / python-machine-learning-book-2nd-edition

mahmoud / boltons

rushter / data-science-blogs

catboost / catboost

Related topics