Comments

You must log in or register to comment.

BalorNG t1_jcqgc4x wrote

I has 6b parameters, but I bet it cannot answer what has happened on Tiananmen square in 1989 :3

71

username001999 t1_jcrn1aq wrote

We Americans live in a country where kids are regularly gunned down in school so we make ourselves feel better by making jokes about how much worse other countries are for events that happened over 30 years ago. Or we don’t even know our own history, like the Kent State Massacre.

−7

Quail-That t1_jcsgkc4 wrote

Not knowing and not being allowed to know are radically different things. If you want to conflate the two, you are acting in bad faith.

29

username001999 t1_jcsr60y wrote

Can you read Chinese? If so, you can read all about the Tiananmen protest on the Chinese internet or talk to actual Chinese citizens about it on WeChat.

6

xerca t1_jcsnz4j wrote

And derailing any topic that comes out of China into Tiananmen square is not acting bad faith? Especially given that the American company "Open"AI is heavily guarding and paywalling their models while this Chinese group is sharing theirs with the world for everyone to use.

Conflating anything that comes out of a country with 1.5 billion people with your incredibly shallow knowledge of history only serves to demonstrate your ignorance.

3

extopico t1_jcsmc8t wrote

Oh look a wumao deploys wahtaboutism!

4

username001999 t1_jcsr1pi wrote

lol, whining about whataboutism is the last refuge of hypocrites.

2

extopico t1_jcsuio8 wrote

What? No it’s not. Pointing out blatant whataboutism is always independently valid.

Why would you even write what you wrote? Is it a required riposte that’s included in your briefing file, or training?

4

BawkSoup t1_jcsgyoe wrote

Okay, tankie. Keep it about machine learning.

2

BalorNG t1_jcsy0rl wrote

Technically, I'm from Russia.

And, of course, you are able to read every opinion about "special military operation" here... sometimes even without VPN. It is just voicing a "different one" can get you for years into prison and your kids into a foster home for reindocrination. While the programmers that coded it might have a range diverse opinions on this and other "politically sensitive" subjects, if they would want their programm to pass inspection in China, they WILL have to do considerable fine-tuning to throw away sensitive data, if our Russian google (Yandex) frontpage is of any indictation. If this is a foundational model w/o finetunnig that's a different matter tho... but that it will hallucinate nonstop and produce "fakes" anyway...

0

CommunicationLocal78 t1_jcqw9zq wrote

There are a lot fewer forbidden topics in China than in the West.

−60

gronaninjan t1_jcr6746 wrote

Name one

27

redpandabear77 t1_jcrng6h wrote

Name one forbidden topic in China that doesn't have to do with criticizing the government.

−10

GaggiX t1_jcrvtz6 wrote

I mean, even the Taiwan flag emoji is banned on Chinese phones lmao

21

the320x200 t1_jcqxqrs wrote

Please... That's ridiculous. Name one historical event people in the west are afraid to even admit to knowing about in public.

5

Riboflavius t1_jcs7afw wrote

Pretty sure whoever knows what happened to Jimmy Hoffa made sure they kept their trap shut in public… ;)

7

NotARedditUser3 t1_jcr0vsb wrote

You'd know exactly how you were wrong if those topics weren't forbidden and you'd actually heard about them

−13

farmingvillein t1_jcsnx0f wrote

"open source".

That license, lol:

> You will not use, copy, modify, merge, publish, distribute, reproduce, or create derivative works of the Software, in whole or in part, for any commercial, military, or illegal purposes.

> You will not use the Software for any act that may undermine China's national security and national unity, harm the public interest of society, or infringe upon the rights and interests of human beings.

> This license shall be governed and construed in accordance with the laws of People’s Republic of China. Any dispute arising from or in connection with this License shall be submitted to Haidian District People's Court in Beijing.

What a nightmare.

40

clueless1245 t1_jcteoda wrote

Open source doesn't mean freely used. That's the whole reason there's an F in FOSS.

3

sanxiyn t1_jcw2yoz wrote

On the other hand, commercial use restriction is not compatible with generally accepted definition of open source, for example The Open Source Definition.

> 6) No Discrimination Against Fields of Endeavor. The license must not restrict anyone from making use of the program in a specific field of endeavor. For example, it may not restrict the program from being used in a business, or from being used for genetic research.

7

evangelion-unit-two t1_jcu2o0f wrote

What are they going to do if I violate it? Cry like a baby?

3

Art10001 t1_jcweykf wrote

> Any dispute arising from or in connection with this License shall be submitted to Haidian District People's Court in Beijing.

3

MysteryInc152 OP t1_jcputc0 wrote

Uses relative positional encoding. Long context in theory but because it was trained on 2048 tokens of context, performance gradually declines after that. Finetuning for more context wouldn't be impossible though.

You can run with FP-16 (13GB RAM), 8-bit(10GB) and 4-bit(6 GB) quantization.

36

Temporary-Warning-34 t1_jcpwu9p wrote

'Feedback bootstrap'. Lol.

Sorry. What does that mean?

9

MysteryInc152 OP t1_jcpzgd4 wrote

Bootstrapping is basically taking a model's best/better outputs on a certain task and finetuning on that.

EDIT: Seems I'm wrong on that

15

MisterManuscript t1_jcry6cj wrote

That's not what bootstrapping is, it is a resampling technique used to create multiple datasets of the same size from the original dataset using random sampling with replacement. It is done to get the estimate of the standard deviation of a desired variable.

Here's the link to the ISLR textbook. The bootstrap chapter will verify what it is.

17

MysteryInc152 OP t1_jcrz16i wrote

Yeah I'm wrong it seems. Read a few articles using bootstrapping in the definition I used so I assumed that was generally it.

6

relevantmeemayhere t1_jcrotun wrote

Mm, not really.

Bootstrapping is used to determine the standard error of estimates using resampling. From here we can derive tools like confidence intervals, or other interval estimates.

Generally speaking you do not use the bootstrap to tweak the parameters of your model. You use cross validation to do so.

10

relevantmeemayhere t1_jcrp2rr wrote

Honestly, really comes off as word salad lol.

I haven’t read the details, but it sounds like resampling in a serial learner?

6

visarga t1_jctfir1 wrote

Human Feedback is being boostsrapped by GPT3 predictions "stolen" against OpenAI's will (for just $500 API bills).

1

MisterManuscript t1_jcrwvc8 wrote

I tried googling it, it's is a nonexistent terminology in the realm of statistics. I know what bootstrapping is, but not this version of it.

It's better to ask the GitHub authors about this to make sure they're not just spitting out pseudostatistical terminology.

Addendum: another guy did query the authors regarding this terminology in the issues tab, they did not respond.

2

wyhauyeung1 t1_jcrvnxz wrote

I successfully deployed in my local PC and run. Just wondering, where is the model file stored after install? It seems I could not find any big files under the directory

3