• 0 Posts
  • 21 Comments
Joined 10 months ago
cake
Cake day: October 19th, 2023

help-circle
  • The concept as I understand it is that Threads has the sheer volume of content to completely drown out the existing Fedi content if it fully opens the floodgates. If that occurs and say 90% of content becomes Threads and then they start making Threads only extensions to Activity Pub, servers will have to start patching those in and the Activity Pub project is defacto owned by Meta.

    People also have issues with the Meta content moderation and the population on Threads, but as you noted that’s fixable on an individual and community level. The existential threat to the future of the Fediverse is why servers should defederate. Meta can’t and shouldn’t be trusted with any amount of power over this community project.














  • As I understand it, NAT is a firewall with only a very basic configuration: allow all outbound and accept only established inbound. If you don’t expect to have any incoming connections and completely trust all your internal devices then its good enough.

    However, if you start wanting to port forward for servers (SSH, FTP, video games) you need to poke holes in the NAT firewall and it has no additional configuration options to help you. The same goes for if you have internal (ex. IoT) devices that you don’t necessarily trust, there are no rules to block outbound traffic.



  • I wonder what the risks are to including deleted and pre-edited content in training data. Most of the edits are going to be typos and formatting, do you want 2-3 copies of the same message with typos in them for training data? Similarly, deleted comments are mostly nonsense, unhelpful, duplicate, or highly controversial things.

    If someone wants to dig through and find individual users to restore that’s one thing, but I don’t think I’d immediately choose to train off of that other data unless I had to.



  • We made a tag that can’t be reliably and deterministically scanned so we also included a machine learning model that takes a good guess at it.

    I just don’t see how you could possibly rely on a black box model for anything important. You have no way to mathematically prove if there are collisions in the model output or not, and newer versions of the model can’t be made backwards compatible. So if you have a database of thousands of these tags scanned, then they discover a critical vulnerability and provide a new model, you’re SOL and everything you have is worthless.