If you were writing a search engine through raw text, how would you structure it?

For example, would you remove words like "a" and "the" from the data or the searched string?

How would you deal with apostrophe S? like: cat's vs cats

What other constraints would you build in?
Anyone have an implementation of reading from the streaming #Pleroma public websocket? I can't seem to find one.

Ideally in python or go, but I'll take any language.

Have you lost any friends in today’s hyper-partisan, hyper-political social landscape?

(Please boost for more exposure)

Offer still stands. If you help me implement a follow request in #ActivityPub, I’ll Venmo or Paypay you $100.

Let me know if you wanna see my current code. https://dpaste.org/nNx9 = Follow code Pub/Priv certificates located here = https://dpaste.org/Qyhp Live non-functioning instance running at https://p.farhan.codes.

Anyone know #ActivityPub well and wanna make me $100 by helping me get a Follow request working?

I’m willing to pay by Venmo or Paypal.

I don’t understand HTTP signatures. Are the requests checked? If so, how does one view the certificate? Does each user have their own certificate? If so, how do I view them?

#Fediverse

Lets do a test…

Give me unique text from a random Post anywhere on the #Fediverse in the last 24 hours. As long as its not a locked down instance, I’ll see if I can tell you which account said it.

ie, If the message is “I am going to Florida on Tuesday to vote for the President”, give me “I am going to Florida”.

Really good podcast: LaYinka Sanni | Male insecurities & Muslim women in public | Blood Brothers #48 https://www.youtube.com/watch?v=LqYode0UfUA

What language would you use to write a web app that populate a template with JSON data (AJAX) derived from database query results?

Software design question

I am writing some code that will poll the API of an arbitrarily high number of sites (starting at around 50 but it certainly will go up) once every 1-2 seconds and logging retrieved information to a database.

The sites report highly overlapping data, so there must be a process to prevent duplicates.

What is the best way to structure this?

My off-the-cuff approach is to have each website poll run as a separate thread and send the retrieved data to a database writing thread. The database writing thread will do all duplicate verification. Obviously it is going to be very busy and will likely saturate its connection quite quickly.

I also thought of having the polling threads connect directly to the database - but then the database would have multiple connections and potentially race conditions.

My current working idea is to have the database thread split into 2 parts: A. The main thread process the retrieved information to verify if it’s a duplicate or not (lots of SELECT operations) If new… B. Send the result to a writing thread.

This could potentially also create race-conditions, no?

Is there a more efficient way to do this?

Anyone have the #ActivityPub json specificiation/definition of a public timeline? Looking for the data types of each field.

#ArabicWords - To know

عَلِمَ (‘alima) - “He knew” يعْلَمُ (ya’lamu) - “He knows” عِلْمٌ (‘ilm) - “Knowledge” إعْلَمْ (‘i’lam) - “Know [imperitive]”

Other meanings: مُعَلِّمٌ (mu’allim) - Teacher عالِمٌ (‘aalim) - Scholar عَلَّمَ (‘allama) - To impress or deeply teach عَلامَةٌ (‘alaamah) - Erudite scholar

Example: عَلَّمَ الْإِنسَانَ مَا لَمْ يَعْلَمْ “He taught (impressed) man what he did not know” Ref. Qur’an 96:5 اعْلَمُواْ أَنَّ اللّهَ شَدِيدُ الْعِقَابِ وَأَنَّ اللّهَ غَفُورٌ رَّحِيمٌ “Know that God is severe in punishment and that God is Forgiving, Merciful” Ref. Qur’an 5:98

What’s a good way to popularize the account @ArabicWords ? Its basically me doing the past, present and gerund form of a word, alternate meanings and examples.

#ArabicWords - To cover

غشى (ghashaa) - "He covered"
يغشى (yaghshaa) - "He covers"
غشا (ghashaa) - "Covering"

Other meanings:
To wrap up
To be dark
A covering

Example:
وَاللَّيْلِ إِذَا يَغْشَى - "By the night as it veils [in darkness]" Ref. Qur'an 92:1
وَعَلَى أَبْصَارِهِمْ غِشَاوَةٌ "...and upon their vision a covering" Ref Qur'an 2:7

#ArabicWords - To say

قال (qaala) - “He said” يقول (yaqoolu) - “He says” قول (qawl) - “A saying” قيل (qeela) - “It was said [passive]” يُقالُ (yuqeela) - “It is said [passive]” قلْ (qul) - “Say! [imperitive]”

Example: قُلْ هُوَ اللَّهُ أَحَدٌ - “Say: He is God, One” Ref. Qur’an 112:1

#ArabicWords - To disperse

شتت (shatta) - "He dispersed"
يشتت (yashatatu) - "He disperses"
أشتاتٌ (ashtaatun) - "Dispersed"

Other meanings:
Dispersed
Separated
Scattered
Dissolved

Example:
يَوْمَئِذٍ يَصْدُرُ النَّاسُ أَشْتَاتًا - "On that day, people will come forward separated [from one another]" Ref. Qur'an 99:6
#ArabicWords - To see

رأى (ra'a) - He saw
يرى (yaraa) - He sees
رؤية (ru'yatu) - A sighting

Examples:

يَا بُنَيَّ إِنِّي **أَرَىٰ** فِي الْمَنَامِ أَنِّي أَذْبَحُكَ فَانظُرْ مَاذَا تَرَىٰ
"Oh my Son, I am seeing in dreams that I am sacrificing you. So tell me what you think” Ref. Qur’an 37:102

Other meanings:
Vision
To think
To judge

Note to Arabic students: This verb conjugates according to the معتل rules
Show older
Mastodon for Tech Folks

This Mastodon instance is for people interested in technology. Discussions aren't limited to technology, because tech folks shouldn't be limited to technology either!