Secure Coms · 19 days ago

Huge Computer Assisted Codebook

gitlab.com

Huge Computer Assisted Codebook

gitlab.com

cm0002@no.lastname.nz to

Secure Coms · 19 days ago

Here ForAwhile / ed · GitLab

gitlab.com

Turn Words Into Numbers

Chat

jasory
link
fedilink
arrow-up
1·
18 days ago
“Making frequency analysis ineffective”

Oh boy, let’s hope nobody uses it for large plain texts. If x maps to k1,K2,… then one simply needs enough instances of x to reconstruct the key. It must at the very minimum need multiple symbols to map to the same strings to achieve ambiguity.

The cryptographic claims seem laughable.
- hereforawhile
  link
  fedilink
  arrow-up
  1·
  18 days ago
  
  It must at the very minimum need multiple symbols to map to the same strings to achieve ambiguity.
  
  It does this.
  
  The only conventional cryptography is the shuffle function which takes entropy from the OS.
  - jasory
    link
    fedilink
    arrow-up
    1·
    17 days ago
    What motivated you to write this program?
    
    Your choice of “codebook”, is an immediate red flag and reeks of pop-crypto. There is a reason why this approach was abandoned some 100+ years ago, even properly implemented they have severe shortcomings.
    - hereforawhile
      link
      fedilink
      arrow-up
      1·
      17 days ago
      
      What motivated you to write this program?
      
      Just for fun basically.
      
      I’ve had the idea for awhile but the problem is was always a huge amount of grunt work to get the initial database created. With the use of LLM I basically mined all the unique entries, common phrases.
      
      I’m not claiming it’s the best or anything at all. But for codebook standards…I tried to implement all the things that would make a good code book.
      
      Ability to say the same thing over and over and make it look different for mitigation against frequency analysis.
      
      Easy, secure, shuffling
      
      customizable
      
      Assisted composing
      
      Exportable
      
      Long term rotating key schema
      
      Conclusive and established database
      
      Portable
      - jasory
        link
        fedilink
        arrow-up
        1·
        14 days ago
        Why did you use an LLM for the frequency tables? The “most common words used” is very useful data and as such there are many already existing compilations, used by things like spell checkers. The Linux system dictionaries are one example.
        
        The fact that you completely ignore that simply using a larger RSA key would both be faster and more secure than your approach, doesn’t inspire confidence either.
        
        (It’s also in python which is basically unusable. )
        
        hereforawhile
        link
        fedilink
        arrow-up
        1·
        12 days ago
        I used a LLM to create my database because it is not only a collection of words, but common phrases. Plus not only can the LLM format the database how I want it so it’s interpretable to the program, it can build the database and included all the appropriate amount of duplicates.
        
        The fact that you completely ignore that simply using a larger RSA key would both be faster and more secure than your approach, doesn’t inspire confidence either.
        
        The goal was to not use any modern crypto… Codebooks have been used for a very long time and are secure with proper key management.
        
        This is an attempt at a modern codebook. It tackles most all of the shortcomings of previous iterations.
        
        (It’s also in python which is basically unusable. )
        
        Haha.
        
        jasory
        link
        fedilink
        arrow-up
        1·
        10 days ago
        “but common phrases”. These also exist, they are used in grammar checkers. They also exist in texts for English learners.
        
        Datasets like these are very easy to come by. In fact you could actually write a program that set up a Markov matrix of pairs of words for any input text, and use it to determine common phrases. This is the standard sloppy approach, a more clever one would restrict the pairing to grammatically valid ones.
        
        hereforawhile
        link
        fedilink
        arrow-up
        1·
        9 days ago
        I mean what’s the real point you are arguing? I’m happy to include other datasets in the master database. A bigger database is no problem for this schema or SQLite limitations.
        
        The LLM produced all these things with one or two prompts and they are all grammatically valid… It’s just what I happened to source the initial data set from.
        
        jasory
        link
        fedilink
        arrow-up
        1·
        8 days ago
        My point is that your approach is awful. It’s like you completely fumbled into your idea, and you’re trying to sell it as superior to rigorously constructed cryptosystems ( nearly all exploits are due to developer incompetence not cryptographers).
        
        “They are all grammatically valid”- yeah you have no idea what I just said. I was talking about constructing a probability matrix from a language, if you restrict the entries to grammatically valid pairs/tuples it reduces the size and is therefore easier to compute. Whether or not your ciphertext is grammatically valid English has zero effect on its strength.
        
        The reason why you might want to take the approach I described is that you can make precise claims about the dataset and final result. Rather than saying “umm … Chatgpt said so…”.
        
        Regardless, this has nothing to do with cryptographic security. It’s just an immediate red flag when developers miss obvious solutions.

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a community for enthusiest who love to ponder new ways for Alice to communicate with Bob in a world where global passive adversarys probably record every bit that ever crosses the wire.

Discuss cryptography, secure key exchange, private messangers, radios, encoding, networking tools, authentication mechanisms and anything relevant to coming up for ways to Alice to get a message to Bob.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
4 users / week
18 users / month
123 users / 6 months
9 local subscribers
87 subscribers
22 Posts
27 Comments
Modlog

mods:
cm0002