| Literature DB >> 35161859 |
Humaira Arshad1, Saima Abdullah1, Moatsum Alawida2, Abdulatif Alabdulatif3, Oludare Isaac Abiodun4, Omer Riaz5.
Abstract
Currently, law enforcement and legal consultants are heavily utilizing social media platforms to easily access data associated with the preparators of illegitimate events. However, accessing this publicly available information for legal use is technically challenging and legally intricate due to heterogeneous and unstructured data and privacy laws, thus generating massive workloads of cognitively demanding cases for investigators. Therefore, it is critical to develop solutions and tools that can assist investigators in their work and decision making. Automating digital forensics is not exclusively a technical problem; the technical issues are always coupled with privacy and legal matters. Here, we introduce a multi-layer automation approach that addresses the automation issues from collection to evidence analysis in online social network forensics. Finally, we propose a set of analysis operators based on domain correlations. These operators can be embedded in software tools to help the investigators draw realistic conclusions. These operators are implemented using Twitter ontology and tested through a case study. This study describes a proof-of-concept approach for forensic automation on online social networks.Entities:
Keywords: automation tools; evidence analysis; experimental visualization; forensic applications; forensic automation; semantic data presentation; social network forensics
Mesh:
Year: 2022 PMID: 35161859 PMCID: PMC8839830 DOI: 10.3390/s22031115
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The multi-layered conceptual model.
Figure 2The architecture of the multi-layered model.
Figure 3Implementation details of the multi-layered model.
Figure 4Raw data downloaded by Twitter scraper.
PHP functions and their applications.
| PHP Functions | Applications |
|---|---|
| getmxrr ($hostname, | Obtain the names of mail exchanger hosts for a specific host. |
| gethostbyaddr ($ip) | Collect the hostname associated with an IP address. |
| gethostbynam e($name) | Gather the IP address associated with a hostname. |
| checkdnsrr ($host, $type) | Checks the DNS for records of type $type for host $host and returns Boolean true if any are found. |
| dns_get_record | Retrieves the DNS record for host |
| getservbyname | Obtains the port number for the service $service via the protocol $protocol. |
Figure 5PHP codes for Facebook API data retrieval.
Figure 6PHP codes for Twitter API data retrieval.
Turtle serialized data representation of USER and TWEET construct in the data model and RDF stores.
| User Concept on Twitter. | An Individual of Class Tweet. |
|---|---|
|
| #Tweet |
SPARQL queries to extract data from RDF stores.
| SELECT ? time_stamps | SELECT ? time_stamps |
| Query (A). Timestamps for all the objects | Query (B). Timestamps for all the Tweets Created by a user “Alice.” |
List of Analysis Operators.
| Operators Name | Description | |
|---|---|---|
| 1 | Interaction Graph | This operator uses subject and object correlations. |
| 2. | Interaction Frequency Analysis | This function is based on subject and object correlations. It is used to perform a frequency analysis of communications among two users to sort and filter communication among users. It helps identify the dynamics of their relationships. |
| 3. | Temporal Activity Graph | This function uses temporal correlations, as explained in the section. It is used to analyze a user’s activity pattern in a specific period. |
| 4. | Geo-location Activity Graph | This operator uses object correlations and helps sort the locations that are tagged in online content. |
| 5. | Hashtag Cloud | This function is based on object correlations and is designed to give a quick overview of the hashtags used in tweets. |
| 6. | Tweet Cloud | This method is also based on object correlations and is designed to give a quick overview of the topics or themes existing in someone’s tweets. |
| 7. | Similarity of Views | This operator is based on rule-based correlations and identifies the nearness of opinion among two users. |
| 8. | Trace Operator | Trace is an operator that links the evidence to the entity. |
SPARQL queries are used to retrieve data from RDF data stores.
|
|
|
|
| Select ?loc ?placename ?lat ?longt ?time |
| Result: A List of retweets and their ids tweeted by Subject(A) and retweeted by Subject (B). | Result: A list of Place names, longitude & latitude values. |
Figure 7Interaction graph (a) from the subject to her contacts; (b) from contacts to subject.
Figure 8(a) Temporal activity graph for the subject. (b) Temporal activity graph for one of the users.
Figure 9Tweet cloud generated from the tweets of a cyber-bullying suspect.
Figure 10(a) The objects of the subject re-shared by her contacts. (b) The objects of other users re-shared by the subject.
Figure 11(a) Timeline of places visited by subjects. (b) Raw geographic data.
Figure 12Links from evidence to the objects.