Dev by ANugmanova · Pull Request #1 · ANugmanova/train_model

ANugmanova · 2017-12-20T19:08:59Z

No description provided.

andreibsmirnov · 2017-12-20T20:58:01Z

+
+train = pd.read_csv(os.path.join(os.path.dirname(__file__), 'data', 't.csv'), header=0, delimiter="\t", quoting=3)  #открывается обучающий датасет
+
+train = train[:5000]


а почему и здесь и при обучении deepmoji все только по 5000 твит обрезается

andreibsmirnov · 2017-12-20T20:58:55Z

+model = LinearSVC(penalty='l2', loss='squared_hinge', dual=True, tol=0.0001, C=1.0, multi_class='ovr', 
+                  fit_intercept=True, intercept_scaling=1, class_weight=None, verbose=0, random_state=None, max_iter=1000)
+model.fit(X, y)
+print ("20 Fold CV Score. Bag of words: ", np.mean(cross_validation.cross_val_score(model, X, y, cv=20, scoring='roc_auc')))


другая модель без кроссвалидации ведь проверяется?

Да, но это просто старый код, я его не меняла.

andreibsmirnov · 2017-12-20T21:14:01Z

-
+def train_model(nb_classes, DATASET_PATH, DATASET_PATH_PRETRAINED = '',
+                PRETRAINED_PATH='', delete_non_raws = False, save_model = False):
+    vocab = {


а что вот это за слова, кстати?

это те теги, которые добавляются в препроцессинге у авторов

А для русского они тоже нужны?

ну вот CUSTOM_URL и CUSTOM_NUMBER не зависят от языка, но по идее нужно будет проверить

andreibsmirnov · 2017-12-20T21:14:41Z

+
+def review_to_wordlist( review, remove_stopwords=False ):
+    # review_text = BeautifulSoup(review).get_text()
+    review_text = review


andreibsmirnov · 2017-12-20T21:15:47Z

            df['sent'].append(emoji_dict[emoji_name])
    return df

 df = {'text':[], 'id':[], 'sent':[]}


if name == 'main'

вот ты зануда)

А вот и нет. Я импортнул отсюда словарь и у меня вышла ошибка, что какого-то файлика не хватает

А, ну я его для других целей создавала просто)

ANugmanova added 3 commits December 17, 2017 17:00

test

48dc3c4

add pretrained model and fixed errors in train

b3e482e

test svm

78b3868

andreibsmirnov reviewed Dec 20, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev#1

Dev#1
ANugmanova wants to merge 3 commits intomasterfrom
dev

ANugmanova commented Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

ANugmanova Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

ANugmanova Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

ANugmanova Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

ANugmanova Dec 20, 2017

Uh oh!

andreibsmirnov Dec 20, 2017

Uh oh!

ANugmanova Dec 20, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		train = pd.read_csv(os.path.join(os.path.dirname(__file__), 'data', 't.csv'), header=0, delimiter="\t", quoting=3) #открывается обучающий датасет

		train = train[:5000]

Conversation

ANugmanova commented Dec 20, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants