Ekky

Ekky@sopuli.xyz · edit-2 13 days ago

Can’t those both be true at the same time?

The “system” is working as intended by the rich elite/<insert antagonist>, which means that it’s fundamentally broken for the general populace, and therefore must be fixed, which is easiest done by first destroying and then rebuilding it?

Seems like an oxymoron to me, but I’m not entirely sure of the context.

Ekky@sopuli.xyz · 29 days ago

You thought journalism had reached rock bottom already? Watch this:

Ekky@sopuli.xyz · edit-2 2 months ago

I’m gonna be honest: I’ve been skimping on anti malware since i moved to Linux.

Still keeping up the common sense part about running code you don’t know and running untrusted code and weird URLs in a virtual environment (well, except for the AUR perhaps), but I only scan for malware once or twice a year, if at all.

Actually, I just did a scan with RKHunter which came back clean except for the usual false flags, which I find mildly suspicious as one would imagine there to be some malware with all the small time programmers and script kiddies in the Linux community.

What are you using as anti malware? Anyone knows of good methods for set-and-forget or some good GUIs for easy containment management, scanning, and whitelisting? It can’t be that ClamAV, RKHunter, and chkrootkit are the only halfway decent AVs out there.

Ekky@sopuli.xyz · 2 months ago

Huh? That’s quite interesting.

I’ve been running a hacked-together script which uses a disembodied copy of Proton 8 (aka. copied to a portable drive, doesn’t need to have Steam installed to run) to launch my games from Itch and GoG.

Hmm, just tried to use Proton 9.0-2 and the current experimental in my steamapps (which appears to be version 9.0-202), and it works just fine. Though, I guess Lutris’ implementations are quite a bit more advanced than my hacks (no debugging let’s goooo).

A very simplified version of my script, for those who might be interested: pastebin.com/kbNNvzAx. Don’t forget to uncomment game_exe and set it to your executable - won’t work otherwise.

Also, pinging @DacoTaco@lemmy.world in case of interest.

Ekky@sopuli.xyz · 2 months ago

I guess you could also ask: “Does the pro-tier give one any options/additional functionality that the non-pro/non-donation tier doesn’t?”

Obviously, if you have to pay for additional functionality (like settings/themes/updates) then it isn’t a simple ask for donation. Though, I’d argue to ignore trivialities such as “thank you”-emails and possibly a small visual-only token on the program that you paid/donated, as those barely count as “functionality”.

Ekky@sopuli.xyz · edit-2 2 months ago

~~“Batteries” is a rather broad category.

Are we talking hydroelectric batteries? Other potential or kinetic batteries? Chemical batteries (and what subcategory)? Or maybe hydrogen-based power storages?

Since there’s a dam on the list, I’d imagine “batteries” to be electrolytic power stores or hydrogen fuel cells, but the visualization remains lazy and perhaps borderline misinformative (depending on how nit-picky you are).

EDIT: The illustration might also use a simplified definition of a battery (to store, excluding conversion between kinds of power) instead of the different battery technologies which exist or the full definition, which could have one argue that batteries aren’t renewable by definition.

Though, that might be reading too much into it.~~

Actually, never mind, I’m probably too tired to go out on an adventure about the technicalities of the definition of “battery” to make any real amount of sense and not fall into edge cases.

I also misread “energy source” as “renewable”…

Ekky@sopuli.xyz · 3 months ago

You don’t have to sanitize the weights, you have to sanitize the data you use to get the weights. Two very different things, and while I agree that sanitizing a LLM after training is close to impossible, sanitizing the data you give it is much, much easier.

Ekky@sopuli.xyz · 3 months ago

Oh no, it’s very difficult, especially on the scale of LLMs.

That said, we others (those of us who have any amount of respect towards ourselves, our craft, and our fellow human) have been sourcing our data carefully since way before NNs, such as asking the relevant authority for it (ex. asking the post house for images of handwritten destinations).

Is this slow and cumbersome? Oh yes. But it delays the need for over-restrictive laws, just like with RC crafts before drones. And by extension, it allows those who could not source the material they needed through conventional means, or those small new startups with no idea what they were doing, to skim the gray border and still get a small and hopefully usable dataset.

And now, someone had the grand idea to not only scour and scavenge the whole internet with no abandon, but also boast about it. So now everyone gets punished.

At last: don’t get me wrong, laws are good (duh), but less restrictive or incomplete laws can be nice as long as everyone respects each other. I’m excited to see what the future brings in this regard, but I hate the idea that those who facilitated this change likely are the only ones to go free.

Ekky@sopuli.xyz · 3 months ago

So now LLM makers actually have to sanitize their datasets? The horror…

Ekky@sopuli.xyz · edit-2 4 months ago

The absolute ridicule! I’m sorry, but I might not survive this! How could this come to be?!

Ekky@sopuli.xyz · 4 months ago

Nuh-uh, I saw a Steam survey that said that less than two percent of computers use Linux!

What do you mean by “the headless internet backbone servers, Android phones, and smart appliances don’t have Steam”?

Ekky@sopuli.xyz · 5 months ago

Question marks are overrated, so are commas and periods And now that we are at it mst ppl cn ndrstnd wrttn txt jst fn wtht wvls s lts jst drp thm t

prd

Ekky@sopuli.xyz · 5 months ago

Thank you for the explanation, though the underlying requirements for keeping a list locally appear to remain much the same, since you really only need to add a few trigger words to the “dumb, always-on” local parser (such as your top 1000 advertisers’ company or product names). After all, I’d imagine we do not require context, but only really need to know whether a word was said or not, not unlike listening for the “real” trigger word.

This is of course only one of many ways to attack such a problem, and I do not know how they ultimately would do, assuming that they were interested in listening in on their users in the first place.

And yes, embedded devices are slightly harder to fiddle with than using your own computer, but I’d bet that they didn’t actually take the time to make a proper gate array and instead just use some barebones Linux, which most likely means UART access!

Ekky@sopuli.xyz · 5 months ago

Assuming that they parse everything locally, which appears to be the case, then why would it have to send a constant stream of audio? A small list/packet of keywords of a few bytes or KB once a day would suffice for most telemetry (including ad analysis and other possible spying reasons’) needs.

Also, one ought to be able to see the contents of the packets if they retrieve the devices’ SSL key for the session, so this should also be falsifiable.

Ekky@sopuli.xyz · 6 months ago

Didn’t Windows 10 also have ads in the start menu from pretty much the start, like Candy Crush and such? Or maybe I just used a bloated OS image, wouldn’t be beyond me.

Ekky@sopuli.xyz · 6 months ago

Isn’t that a good thing in this case? Once someone invents a new innovative slur you can just go to ud and search it up, instead of making assumptions or having to ask them yourself.

Ekky@sopuli.xyz · 6 months ago

Rendering issues are just a minor inconvenience, the real horror starts when your chunk is offloaded and you get sent to the void.

Ekky@sopuli.xyz · 7 months ago

And then links to a similar sounding but ultimately totally unrelated site.

Ekky@sopuli.xyz · 7 months ago

I don’t usually lick the ice buildup off the back of the fridge, but I’ll keep it in mind if I ever choose to.

Ekky@sopuli.xyz · 9 months ago

It appears, that with the increase in popularity of machine learning, the percentage of people who properly source and sanitize their training data has steeply decreased.

As you stated, a MLAI can only be as good as the data it was trained on, and is usually way worse. The popularity and application of MLAIs built with questionable practices scare me, though, at least their fuckups will keep me employed and likely more busy than ever.