• 41 Posts
  • 222 Comments
Joined 1 year ago
cake
Cake day: June 9th, 2023

help-circle


  • You’ve got to do some manual config. I know about it but don’t use it. You can redirect home folders with the container in the distrobox create flags. I think the better option is to use the user/groups/SELinux context in addition to the container as this will show up in ownership and is more easy to trace. One of my main problems is how packages have Python installation requirements that by default try to break pip out of any containerized context and create their own venv setup. It totally screws up the whole distrobox container setup and separation from the base system.


  • With Linux over the years, I have learned to ignore all hardware marketing as (basically) scammers. The supporting software is the important part. If the software is not open source, the product is only available to rent and likely includes or has the potential to become an extortion scam of subscription parasites. When I shop for products now, I do so by searching for the open source software first. Once I find a large project with several contributors, I git clone the repo and then I run an app called gource on the command line. Gource creates a 3d visualization of the project over time and its commit history. Have a look at the Linux kernel some time or just watch a video of someone that has uploaded the visualization: https://www.youtube.com/watch?v=5iFnzr73XXk

    With the actual visualization, you can zoom in and select the individuals or watch branches specifically. The trick is to get an idea of who the main contributors are in the various spaces and how consistent they are. Find who is working on what hardware and how they are working on it. Some times you’ll see a person comes in and only makes a single commit or a few that contain everything for a device and then they disappear. These are often subcontracted devs that a company hires and gives a checklist. Issues, bugs, and unsupported features are unlikely to get fixed unless you see someone else that is making commits in this space. What you’re really looking for is one of the main project devs that makes ongoing commits to some specific hardware over longer amounts of time and fairly recently. It means they have the device in question. That generally means the device has or will have excellent support in the long term. It also generally means the person either really liked the product or the company is smart enough to supply the dev with the device or supporting documentation.

    Sorry if this seems unsolicited. It took me a long time to break out of the hardware spec shopping fallacy and all of the troubles it can cause. Prioritizing true ownership and shopping for the software first is a far more enjoyable life experience. It likely won’t help in this niche, but for computers in general use: https://linux-hardware.org/

    You will likely find that search engines attempt to obfuscate this information. Expect that. Use offline open source LLM’s, ask the community, or more advance searching methods to find relevant info. Both m$ and the goo are the two biggest beneficiaries of the proprietary software ecosystem and they are the only two web crawlers that exist at relevant scale. All search engines use one or both of these sources either directly or by proxy.


  • TBH: tl;dr (…but read ~1/4 and skimmed the rest.)

    Emacs can likely do most, if not all, of what you’re looking for.

    As far as distros, go with either Fedora Workstation or Silverblue. If you can run SB, try to avoid messing with the base system as much as possible, skip using the toolbox containers system and just use distrobox. With distrobox, you have almost all Linux distros available as containers, so you build on them. The only exception I know of is NIX. You can’t run NIX in distrobox. You probably could run the NIX package manager, but that involves this weird setup where a user owned directory exists in / root. Personally, this is just too weird for me to use it. I expect all user activity and configuration files to be confined to /home/$USER/

    Fedora just works, but try and lag behind the release cycle a little bit. Like right now F40 is pretty solid, but there were some issues in the first month or so after F40 first came out. I have lagged in every release since ~F28 and never had issues. I switched to F40 within the first week or so and a few packages were wonky. Basically Python was super fresh and did some odd stuff with containers where it did not work without manually removing and replacing Python in each container. I think that was the only manual intervention issue I’ve had with Fedora. I have a 3080Ti laptop with the 16 GB GPU. The Anaconda system in Fedora builds the Nvidia kernel module automatically in the background each time the kernel is updated. It works flawlessly, even with secure boot enabled.










  • Yes .docx.

    It appears as though the encoding is missing in such a way that nothing in Linux recognizes the file. The underlying CLI tools don’t have a way of converting the file. I tried with Python’s docx tool and with iconv. It has to be encoding related because some tools initially load the file with several sets of Asian characters instead of English. However, there is no hexadecimal or sections of entirely binary looking data. Archiving tools do not open up the the file to reveal anything else like a metafile or header. Neo vim shows garbled nonsense throughout. Bat warns of binary. Python won’t load the file, nor will Only Office. Libre Office and Abi Word load initially with Asian characters before crashing.

    The only option is likely gong to be setting up the W10 machine and converting a bunch of files within it.

    Ultimately, my old man thinks he can be an author all of the sudden and is trying to write. He’s not very capable of learning. I’m not confident that he can learn to use FOSS to do the same thing he has been doing. This post was just to see if there are options I am not already aware of that might actually work in practice. I can easily do everything I need in FOSS. I can do everything he needs to do. I’m more concerned about becoming his tech support when he forgets how to copy pasta. He already fails to separate the internet hardware connectivity from the web browser and operating system within his mental model of technology.










  • j4k3@lemmy.worldtoLinux@lemmy.ml...
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 month ago

    Distilbert https://huggingface.co/distilbert/distilroberta-base

    …was setup for something like that here but note that the repo that runs this has an “unsafe” warning that I have not looked into: https://huggingface.co/spaces/nasrin2023ripa/multilabel-book-genre-classifier

    https://huggingface.co/spaces/nasrin2023ripa/multilabel-book-genre-classifier/tree/main

    It might be fine or whatnot, I’m on mobile and can’t see the file in question. The associated Python code might be a helpful starting point.

    In my experience, most models intentionally obfuscate copyright sources. They all know the materials to various degrees, but they are not intended to replicate sources. They all have strong interference in place to obscure both their recognition and reproduction potential. If, for instance, you can identify where errors are inserted and make a few corrections, they often continue adding a few details that are from the original source. If this is done a few times in a row, they tend to gain more freedom before reverting to obfuscation again. This is the behavior I look out for. It is a strong tool too if you get creative in application.

    Perhaps someone posts an API to look up the library of congress classification of a work based on a few lines or something. GL