Comments
username001999 t1_jcrn1aq wrote
We Americans live in a country where kids are regularly gunned down in school so we make ourselves feel better by making jokes about how much worse other countries are for events that happened over 30 years ago. Or we don’t even know our own history, like the Kent State Massacre.
Quail-That t1_jcsgkc4 wrote
Not knowing and not being allowed to know are radically different things. If you want to conflate the two, you are acting in bad faith.
username001999 t1_jcsr60y wrote
Can you read Chinese? If so, you can read all about the Tiananmen protest on the Chinese internet or talk to actual Chinese citizens about it on WeChat.
xerca t1_jcsnz4j wrote
And derailing any topic that comes out of China into Tiananmen square is not acting bad faith? Especially given that the American company "Open"AI is heavily guarding and paywalling their models while this Chinese group is sharing theirs with the world for everyone to use.
Conflating anything that comes out of a country with 1.5 billion people with your incredibly shallow knowledge of history only serves to demonstrate your ignorance.
extopico t1_jcsmc8t wrote
Oh look a wumao deploys wahtaboutism!
username001999 t1_jcsr1pi wrote
lol, whining about whataboutism is the last refuge of hypocrites.
extopico t1_jcsuio8 wrote
What? No it’s not. Pointing out blatant whataboutism is always independently valid.
Why would you even write what you wrote? Is it a required riposte that’s included in your briefing file, or training?
BawkSoup t1_jcsgyoe wrote
Okay, tankie. Keep it about machine learning.
BalorNG t1_jcsy0rl wrote
Technically, I'm from Russia.
And, of course, you are able to read every opinion about "special military operation" here... sometimes even without VPN. It is just voicing a "different one" can get you for years into prison and your kids into a foster home for reindocrination. While the programmers that coded it might have a range diverse opinions on this and other "politically sensitive" subjects, if they would want their programm to pass inspection in China, they WILL have to do considerable fine-tuning to throw away sensitive data, if our Russian google (Yandex) frontpage is of any indictation. If this is a foundational model w/o finetunnig that's a different matter tho... but that it will hallucinate nonstop and produce "fakes" anyway...
gkaykck t1_jcrel1c wrote
Not cool
evangelion-unit-two t1_jcu2fw9 wrote
Tankie detected
[deleted] t1_jcyxnc3 wrote
[removed]
CommunicationLocal78 t1_jcqw9zq wrote
There are a lot fewer forbidden topics in China than in the West.
gronaninjan t1_jcr6746 wrote
Name one
redpandabear77 t1_jcrng6h wrote
Name one forbidden topic in China that doesn't have to do with criticizing the government.
GaggiX t1_jcrvtz6 wrote
I mean, even the Taiwan flag emoji is banned on Chinese phones lmao
[deleted] t1_jcsam76 wrote
[removed]
endless_sea_of_stars t1_jcrv26g wrote
Outside of criticizing government or religion can you name an illegal topic anywhere?
[deleted] t1_jcsaswp wrote
[removed]
the320x200 t1_jcqxqrs wrote
Please... That's ridiculous. Name one historical event people in the west are afraid to even admit to knowing about in public.
Riboflavius t1_jcs7afw wrote
Pretty sure whoever knows what happened to Jimmy Hoffa made sure they kept their trap shut in public… ;)
NotARedditUser3 t1_jcr0vsb wrote
You'd know exactly how you were wrong if those topics weren't forbidden and you'd actually heard about them
[deleted] t1_jcr1g7n wrote
[removed]
farmingvillein t1_jcsnx0f wrote
"open source".
That license, lol:
> You will not use, copy, modify, merge, publish, distribute, reproduce, or create derivative works of the Software, in whole or in part, for any commercial, military, or illegal purposes.
> You will not use the Software for any act that may undermine China's national security and national unity, harm the public interest of society, or infringe upon the rights and interests of human beings.
> This license shall be governed and construed in accordance with the laws of People’s Republic of China. Any dispute arising from or in connection with this License shall be submitted to Haidian District People's Court in Beijing.
What a nightmare.
clueless1245 t1_jcteoda wrote
Open source doesn't mean freely used. That's the whole reason there's an F in FOSS.
sanxiyn t1_jcw2yoz wrote
On the other hand, commercial use restriction is not compatible with generally accepted definition of open source, for example The Open Source Definition.
> 6) No Discrimination Against Fields of Endeavor. The license must not restrict anyone from making use of the program in a specific field of endeavor. For example, it may not restrict the program from being used in a business, or from being used for genetic research.
evangelion-unit-two t1_jcu2o0f wrote
What are they going to do if I violate it? Cry like a baby?
Art10001 t1_jcweykf wrote
> Any dispute arising from or in connection with this License shall be submitted to Haidian District People's Court in Beijing.
evangelion-unit-two t1_jcwltmt wrote
And what are they going to do, ban me from entering China? Thank god.
MysteryInc152 OP t1_jcputc0 wrote
Uses relative positional encoding. Long context in theory but because it was trained on 2048 tokens of context, performance gradually declines after that. Finetuning for more context wouldn't be impossible though.
You can run with FP-16 (13GB RAM), 8-bit(10GB) and 4-bit(6 GB) quantization.
Temporary-Warning-34 t1_jcpwx16 wrote
RP isn't forever, though.
MysteryInc152 OP t1_jcpxcn5 wrote
Oh for sure. Changed it to long context, i think that's better. I just meant there's no hard context limit.
Temporary-Warning-34 t1_jcpwu9p wrote
'Feedback bootstrap'. Lol.
Sorry. What does that mean?
MysteryInc152 OP t1_jcpzgd4 wrote
Bootstrapping is basically taking a model's best/better outputs on a certain task and finetuning on that.
EDIT: Seems I'm wrong on that
MisterManuscript t1_jcry6cj wrote
That's not what bootstrapping is, it is a resampling technique used to create multiple datasets of the same size from the original dataset using random sampling with replacement. It is done to get the estimate of the standard deviation of a desired variable.
Here's the link to the ISLR textbook. The bootstrap chapter will verify what it is.
MysteryInc152 OP t1_jcrz16i wrote
Yeah I'm wrong it seems. Read a few articles using bootstrapping in the definition I used so I assumed that was generally it.
relevantmeemayhere t1_jcrotun wrote
Mm, not really.
Bootstrapping is used to determine the standard error of estimates using resampling. From here we can derive tools like confidence intervals, or other interval estimates.
Generally speaking you do not use the bootstrap to tweak the parameters of your model. You use cross validation to do so.
[deleted] t1_jcrvumf wrote
[deleted]
relevantmeemayhere t1_jcrp2rr wrote
Honestly, really comes off as word salad lol.
I haven’t read the details, but it sounds like resampling in a serial learner?
visarga t1_jctfir1 wrote
Human Feedback is being boostsrapped by GPT3 predictions "stolen" against OpenAI's will (for just $500 API bills).
MisterManuscript t1_jcrwvc8 wrote
I tried googling it, it's is a nonexistent terminology in the realm of statistics. I know what bootstrapping is, but not this version of it.
It's better to ask the GitHub authors about this to make sure they're not just spitting out pseudostatistical terminology.
Addendum: another guy did query the authors regarding this terminology in the issues tab, they did not respond.
wyhauyeung1 t1_jcrvnxz wrote
I successfully deployed in my local PC and run. Just wondering, where is the model file stored after install? It seems I could not find any big files under the directory
emotionalfool123 t1_jcsmthb wrote
du -hs
for the rescue.
luaks1337 t1_jctagcz wrote
In German this command could be interpreted as "you son of a whore"
Tr4sHCr4fT t1_jctempd wrote
ncdu ftw
emotionalfool123 t1_jctg58u wrote
Thanks for letting me know a better way.
retrogod_thefirst t1_jcsjs3c wrote
!remindme 2 days
RemindMeBot t1_jcsjtqk wrote
I will be messaging you in 2 days on 2023-03-21 05:56:26 UTC to remind you of this link
2 OTHERS CLICKED THIS LINK to send a PM to also be reminded and to reduce spam.
^(Parent commenter can ) ^(delete this message to hide from others.)
^(Info) | ^(Custom) | ^(Your Reminders) | ^(Feedback) |
---|
BalorNG t1_jcqgc4x wrote
I has 6b parameters, but I bet it cannot answer what has happened on Tiananmen square in 1989 :3