Word_Random_Generator/Word_Random_Generator_en.py at master · Stillrot/Word_Random_Generator · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
import pandas as pd
import numpy as np
import random

global Length, word_number, word_collected, word_corpus, count, Day

Day=6
# engine='python'은 csv파일에 한글이 있는 경우에도 파일을 읽어올 수 있게 해주는 속성이다.
df = pd.read_csv("../Word_Random_Generator/wordtest_en.csv", header=None, engine='python')
'''
행의 nan의 개수를 나타내준다. df.isnull().sum(1) 자세한 설명은 https://rfriend.tistory.com/260?category=675917
이를 nan_temp에 넣어주고 nan_num으로 넘파이 배열로 바꾸어준다.
nan_num에는 각 행의 nan의 개수가 담겨 있으므로 총 Length에서 빼주면 된다.
'''
nan_temp=df.isnull().sum(1)
nan_num=np.array(nan_temp)

dataset=df.values

Length=len(dataset[Day-1])-nan_num[Day-1]

word_number=[]

for i in range(1,Length+1):
    word_number.append(i)

'''
word_temp에 원하는 Day의 단어들을 넣어준다.
이때 원하는 행과 Length까지만의 열을 가지고 와야한다. 그 외에는 nan값이 있어 zip이 되지 않는다.
word_corpus를 넘파이 배열로 바꾸어 준 후 reshape 해준다.
2차원 배열을 1차원 배열로 바꾸어 주기 때문에 reshape(Length)로 사용한다.
'''
word_temp=dataset[Day-1:Day,:Length]
word_collected=np.array(word_temp)
word_collected=word_collected.reshape(Length)

word_corpus=[]
temp=[]
for x,y in zip(word_number,word_collected):
    temp=[x,y]
    word_corpus.append(temp)
print(word_corpus)

'''
저장할 파일 이름을 Day와 합쳐서 File_name.hwp로 만들어 준다.
'''
File_name="%s%2d %s.hwp" %("Day",Day,"Test")
f=open(File_name,'w',encoding="utf8")

title="%s%2d %s\n\n" %("Day",Day,"Test")
f.write(title)

'''
i=randint(0,Length-1)를 생성해주어 number_temp에 i가 있다면 count를 1증가시키고
아닌 경우 word_corpus[i]에 있는 번호와 단어를 데이터에 넣어 준다. 그후 number_temp에 i를 추가 시켜준다.
'''
count=0
number_temp=[]
while len(number_temp)<Length:
    i=random.randint(0,Length-1)
    if i in number_temp:
        count+=1
    else:
        print("%2d. %s:" % (word_corpus[i][0],word_corpus[i][1]))
        data="%2d. %s:\n\n" % (word_corpus[i][0],word_corpus[i][1])
        f.write(data)
        number_temp.append(i)
f.close()