help-circle
rss
Why don't we see more distributed large language model projects?
I was wondering this after seeing the following ["Run 100B+ language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading" - this approach might be useful for SD-type models too Discussion](https://github.com/bigscience-workshop/petals) > Worthy of note... If the model is split up layerwise (it seems there are 80 layers), each layer is only 2.5GB of weights. > > Not much communication happens between layers at inference time - just a couple of kilobytes of activations per token (~word) of output generated. > > That means it's theoretically possible to split the work between machines across the internet, and 80 volunteers could run this on regular machines to get the whole thing running. > > — By [u/londons_explorer](https://reddit.com/user/londons_explorer) on [[D] Has anyone got YaLM-100B to run?](https://reddit.com/r/MachineLearning/comments/vpn0r1/d_has_anyone_got_yalm100b_to_run/)
fedilink

OsmAnd version 4.3 mapping app for Android released
Much faster OpenGL map tile rendering and other cool stuff.
fedilink

Chatbot web app + HTTP and Websocket endpoints for BLOOM-176B inference with the Petals client
[Repo](https://github.com/borzunov/chat.petals.ml) [Petals](https://github.com/bigscience-workshop/petals)
fedilink

The hardest scaling issue by Codeberg (a nonprofit, free software platform/service for code hosting)
cross-posted from c/softwareengineering@group.lt: https://group.lt/post/44632 > This kind of scaling issue is new to Codeberg (a nonprofit free software project), but not to the world. All projects on earth likely went through this at a certain point or will experience it in the future. > > When people like me talk about scaling... It's about increasing computing power, distributed storage, replicated databases and so on. There are all kinds of technology available to solve scaling issues. So why, damn, is Codeberg still having performance issues from time to time? > > ...we face the "worst" kind of scaling issue in my perception. That is, if you don't see it coming (e.g. because the software gets slower day by day, or because you see how the storage pool fill up). Instead, it appears out of the blue. > > **The hardest scaling issue is: scaling human power.** > > Configuration, Investigation, Maintenance, User Support, Communication – all require some effort, and it's not easy to automate. In many cases, automation would consume even more human resources to set up than we have. > > There are no paid night shifts, not even payment at all. Still, people have become used to the always-available guarantees, and demand the same from us: Occasional slowness in the evening of the CET timezone? Unbearable! > >I do understand the demand. We definitely aim for a better service than we sometimes provide. However, sometimes, the frustration of angry social-media-guys carries me away... > > two primary blockers that prevent scaling human resources. The first one is: trust. Because we can't yet afford hiring employees that work on tasks for a defined amount of time, work naturally has to be distributed over many volunteers with limited time commitment... second problem is a in part technical. Unlike major players, which have nearly unlimited resources available to meet high demand, scaling Codeberg's systems... TLDR: sustainability issues for scaling because Codeberg is a nonprofit with much limited resources, mainly human resources, in face of high demand. Non-paid volunteers do all the work. So needs more people working as volunteers, and needs more money.
fedilink


Marble - find your way and explore the world
Available for Android, Windows, Mac OS X, Linux
fedilink

BSD Weekly - Issue 137
News 490: New Year’s Plan9’ing: FreeBSD Foundation’s Software Development review of 2022, what can we learn from Vintage Computing, OpenBSD KDE Status Report 2022, a Decade of HardenedBSD, In Praise of Plan9, and more. Tutorials Automation and Hacking Your FreeBSD CLI: The more we automate, the more we can save time and boost our productivity to higher levels. In this article we plan on looking at how to build more human-readable outputs in FreeBSD by leveraging tools like libxo to drive better output for automation APIs. OpenZFS – Data Security vs. Integrity: Data security is about preventing data from being disclosed, ensuring that only the correct people can access it. Data integrity ensures the data is correct, that it has not become corrupt due to hardware failure or other issues. With ZFS, you can get both.
fedilink

AWESOME! pen source karaoke singing game UltraStar Deluxe, inspired by Sony SingStar™ (includes step by step guide to get it working)
tldr guide to get it working: ## Step 1: building you need to build the project (no packages for instance for fedora), but it is easy since build requirements are up to date. However, i had to use the current master version, since the stable release from 2020 wants a version of ffmpeg which is too old for my fedora. received some odd warnings, but the game still works ``` Linking ../game/ultrastardx Warning: "crtbegin.o" not found, this will probably cause a linking failure Warning: "crtend.o" not found, this will probably cause a linking failure 216888 lines compiled, 10.6 sec 59 warning(s) issued 616 note(s) issued make[1]: Leaving directory '/home/username/src/USDX/src' ``` ## Step 2: starting creative commons songs For legal reasons, it doesn't contain for instance copyrighted music and similar. some creative common songs which you can use without much configuration: https://github.com/UltraStar-Deluxe/songs This is great to understand how real examples work and makes step 3 (the interesting part) easier. You need to git clone this repository and then start the game with the path to this repo. ``` $ cd ~/src/USDX/game ~/src/USDX/game $ ./ultrastardx -Songpath ~/songs ``` ## Step 3: starting with copyrighted music Basically, ultrastar just loads whatever music file you dropped in the songs folder. So this way, you can also decide if you want to sing based on an instrumential version or if you want to sing along the original song, with the original singer assisting you. For the text and the notes how high you need to sing, i found this repository which contains both for many popular songs: https://github.com/razzertronic/usdx-songs but it doesn't work yet, the music file is still missing. Look into the creative commons repository and see which folder structure of the songs are expected (cover, text file and music file is important). The music file is also referenced in the song.txt file. This file looks for instance like that. `audio.ogg` must exist. MP3 is just the tag name, so the format doesn't matter. the creative commons repositiory also uses some OGG files. ``` #TITLE:Alle nur geklaut #ARTIST:Die Prinzen #MP3:audio.ogg #BPM:159,75 #GAP:12480 : 0 2 64 Ich ``` Also, i found that the music file i downloaded from youtube and the song.txt don't fit perfectly together, the youtube song lags a few seconds behind. TO fix that, i needed to change the GAP value in the song.txt Another example is: in one Madonna song, i found that one break (short period of silence) was inconsistent to the song.txt. Anyway, i guess you need to be prepared to spend some time to get each song working. Didn't try it yet, but maybe https://github.com/UltraStar-Deluxe/UltraStar-Creator can help with that. However, the software itself, and the overall result is **awesome** and very much on the same level as singstar!
fedilink



Stable Diffusion integration for Krita paint
This looks very interesting for integrating AI painting into an Artist's toolbox.
fedilink


An introduction to open-source games (crosspost for Lemmy)
Here's a blog post where I talk about some well-known open-source games: [libranet.de/display/0b6b25a8-7…](https://libranet.de/display/0b6b25a8-7063-cc8c-7f9f-116272113110) I was originally going to post it directly to this group, but for some reason whenever I tried it the message got stuck "pending" and removing the mention fixed it.
fedilink


Alternative to stremio + torrentio?
For some reason this has stopped working for me. Is there any alternative for streaming torrents with the discovery that stremio offers? I know I can use jellyfin + ombi + prowlarr + radarr, but installing and configuring it isn't as simple as with stremio. Something like this using Jellyfin would be nice: https://github.com/klutchell/mediaserver
fedilink



Seeking Volunteers for Landlord Accountability Project
PS: I'm not the OP. Just passing this along.
fedilink

Orange Pi Maker Plans to Release an Arch-based Linux Distro Soon
Where was this announced? When they release it, I want to know.
fedilink


All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

  • Posts must be relevant to the open source ideology
  • No NSFW content
  • No hate speech, bigotry, etc

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

  • 0 users online
  • 18 users / day
  • 30 users / week
  • 73 users / month
  • 324 users / 6 months
  • 8 subscribers
  • 980 Posts
  • 3.39K Comments
  • Modlog
Lemmy
A community of privacy and FOSS enthusiasts, run by Lemmy’s developers

What is Lemmy.ml

Rules

  1. No bigotry - including racism, sexism, ableism, homophobia, transphobia, or xenophobia. Code of Conduct.
  2. Be respectful. Everyone should feel welcome here.
  3. No porn.
  4. No Ads / Spamming.

Feel free to ask questions over in: