- Introduction
- Architecture
- Quick start
- Administrator's guide
- Sensor
- Server
- User's guide
- Reporting interface
- Real-life cases
- Mass scans
- Anonymous attackers
- Service attackers
- Malware
- Suspicious domain lookups
- Suspicious ipinfo requests
- Suspicious direct file downloads
- Suspicious HTTP requests
- Port scanning
- Potential UDP exfiltration
- False positives
- Requirements
- License
- Donate
Maltrail is a malicious traffic detection system, utilizing publicly available (black)lists containing malicious and/or generally suspicious trails, along with static trails compiled from various AV reports and custom user defined lists, where trail can be anything from domain name (e.g. zvpprsensinaix.com
for Banjori malware), URL (e.g. http://109.162.38.120/harsh02.exe
for known malicious executable) or IP address (e.g. 103.224.167.117
for known attacker). Also, it has (optional) advanced heuristic mechanisms that can help in discovery of unknown threats (e.g. new malware).
The following (black)lists (i.e. feeds) are being utilized:
alienvault, autoshun, badips, bambenekconsultingc2,
bambenekconsultingdga, binarydefense, bitcoinnodes, blocklist,
botscout, bruteforceblocker, ciarmy, cruzit, cybercrimetracker,
dshielddns, dshieldip, emergingthreatsbot, emergingthreatscip,
emergingthreatsdns, feodotrackerdns, feodotrackerip, greensnow,
malwarepatrol, malwareurlsnormal, maxmind, myip, nothink,
openbl, openphish, palevotracker, proxylists, proxyrss,
proxy, riproxies, rutgers, sblam, snort, socksproxy,
sslipbl, sslproxies, torproject, torstatus, voipbl, vxvault,
zeustrackerdns, zeustrackerip, zeustrackermonitor, zeustrackerurl,
etc.
As of static entries, the trails for the following malicious entities (e.g. malware C&Cs) have been manually included (from various AV reports):
alureon, android_stealer, angler, aridviper, axpergle,
babar, balamid, bamital, bankpatch, bedep, black_vine,
bubnix, carbanak, careto, casper, chewbacca, cleaver,
conficker, cosmicduke, couponarific, crilock, cryptolocker,
cryptowall, ctblocker, darkhotel, defru, desertfalcon,
destory, dorifel, dorkbot, dridex, dukes, dursg,
dyreza, emotet, equation, evilbunny, expiro, fakeran,
fareit, fbi_ransomware, fiexp, fignotok, fin4,
finfisher, gamarue, gauss, htran, jenxcus, kegotip,
kovter, lollipop, lotus_blossom, luckycat, mariposa,
miniduke, modpos, nbot, nettraveler, neurevt, nitol,
nonbolqu, nuqel, nwt, nymaim, palevo, pdfjsc, pift,
plugx, ponmocup, powelike, proslikefan, pushdo,
ransirac, redoctober, reveton, russian_doll, sality,
sathurbot, scieron, sefnit, shylock, siesta, simda,
sinkhole_1and1, sinkhole_abuse, sinkhole_blacklistthisdomain,
sinkhole_certpl, sinkhole_drweb, sinkhole_fbizeus,
sinkhole_fitsec, sinkhole_georgiatech, sinkhole_kaspersky,
sinkhole_microsoft, sinkhole_shadowserver, sinkhole_sinkdns,
sinkhole_zinkhole, skyper, smsfakesky, snake, snifula,
sofacy, stuxnet, teerac, teslacrypt, torpig,
torrentlocker, unruy, upatre, vawtrak, virut, vobfus,
volatile_cedar, vundo, waterbug, zeroaccess, zlob, etc.
Maltrail is based on the Sensor <-> Server <-> Client architecture. Sensor(s) is a standalone component running on the monitoring node (e.g. Linux platform connected passively to the SPAN/mirroring port or transparently inline on a Linux bridge) or at the standalone machine (e.g. Honeypot) where it "sniffs" the passing traffic for blacklisted items/trails (i.e. domain names, URLs and/or IPs). In case of a positive match, it sends the event details to the (central) Server where they are being stored inside the appropriate logging directory (i.e. LOG_DIR
described in the Configuration section). If Sensor is being run on the same machine as Server (default configuration), logs are stored directly into the local logging directory. Otherwise, they are being sent via UDP messages to the remote server (i.e. LOG_SERVER
described in the Configuration section).
Server's primary role is to store the event details and provide back-end support for the reporting web application. In default configuration, server and sensor will run on the same machine. So, to prevent potential disruptions in sensor activities, the front-end reporting part is based on the "Fat client" architecture (i.e. all data post-processing is being done inside the client's web browser instance). Events (i.e. log entries) for the chosen (24h) period are transferred to the Client, where the reporting web application is solely responsible for the presentation part. Data is sent toward the client in compressed chunks, where they are processed sequentially. The final report is created in a highly condensed form, practically allowing presentation of virtually unlimited number of events.
Note: Server component can be skipped altogether, and just use the standalone Sensor. In such case, all events would be stored in the local logging directory, while the log entries could be examined either manually or by some CSV reading application.
The following set of commands should get your Maltrail Sensor up and running (out of the box with default settings and monitoring interface "any"):
sudo apt-get install python-pcapy
git clone https://github.com/stamparm/maltrail.git
cd maltrail
sudo python sensor.py
To start the (optional) Server on same machine, open a new terminal and execute the following:
[[ -d maltrail ]] || git clone https://github.com/stamparm/maltrail.git
cd maltrail
python server.py
To test that everything is up and running execute the following:
ping -c 1 136.161.101.53
cat /var/log/maltrail/$(date +"%Y-%m-%d").log
Access the reporting interface (i.e. Client) by visiting the http://127.0.0.1:8338 (default credentials: admin:changeme!
) from your web browser:
Sensor's configuration can be found inside the maltrail.conf
file's section [Sensor]
:
If option USE_MULTIPROCESSING
is set to true
then all CPU cores will be used. One core will be used only for packet capture (with appropriate affinity, IO priority and nice level settings), while other cores will be used for packet processing. Otherwise, everything will be run on a single core. Option USE_FEED_UPDATES
can be used to turn off the trail updates from feeds altogether (and just use the provided static ones). Option UPDATE_PERIOD
contains the number of seconds between each automatic trails update (Note: default value is set to 86400
(i.e. one day)) by using definitions inside the trails
directory (Note: both Sensor and Server take care of the trails update). Option CUSTOM_TRAILS_DIR
can be used by user to provide location of directory containing the custom trails (*.txt
) files.
Option USE_HEURISTICS
turns on heuristic mechanisms (e.g. long domain name (suspicious)
, excessive no such domain name (suspicious)
, direct .exe download (suspicious)
, etc.), potentially introducing false positives. Option CAPTURE_BUFFER
presents a total memory (in bytes of percentage of total physical memory) to be used in case of multiprocessing mode for storing packet capture in a ring buffer for further processing by non-capturing processes. Option MONITOR_INTERFACE
should contain the name of the capturing interface. Use value any
to capture from all interfaces (if OS supports this). Option CAPTURE_FILTER
should contain the network capture (tcpdump
) filter to skip the uninteresting packets and ease the capturing process. Option SENSOR_NAME
contains the name that should be appearing inside the events sensor_name
value, so the event from one sensor could be distinguished from the other. If option LOG_SERVER
is set, then all events are being sent remotely to the Server, otherwise they are stored directly into the logging directory set with option LOG_DIR
inside the [Server]
section. In case that the option UPDATE_SERVER
is set, then all the trails are being pulled from the given location, otherwise they are being updated from trails definitions located inside the installation itself.
When running the sensor (e.g. sudo python sensor.py
) for the first time and/or after a longer period of non-running, it will automatically update the trails from trail definitions (Note: stored inside the trails
directory). After the initialization, it will start monitoring the configured interface (option MONITOR_INTERFACE
inside the maltrail.conf
) and write the events to either the configured log directory (option LOG_DIR
) or send them remotely to the logging/reporting Server (option LOG_SERVER
).
Detected events are stored inside the Server's logging directory (i.e. option LOG_DIR
) in easy-to-read CSV format (Note: whitespace ' ' is used as a delimiter) as single line entries consisting of: time
sensor
src_ip
src_port
dst_ip
dst_port
proto
trail_type
trail
trail_info
reference
(e.g. "2015-10-19 15:48:41.152513" beast 192.168.5.33 32985 8.8.8.8 53 UDP DNS 0000mps.webpreview.dsl.net malicious siteinspector.comodo.com
):
Server's configuration can be found inside the maltrail.conf
section [Server]
:
Option HTTP_ADDRESS
contains the web server's listening address (Note: use 0.0.0.0
to listen on all interfaces). Option HTTP_PORT
contains the web server's listening port. Default listening port is set to 8338
. If option USE_SSL
is set to true
then SSL/TLS
will be used for accessing the web server (e.g. https://192.168.6.10:8338/
). In that case, option SSL_PEM
should be pointing to the server's private/cert PEM file.
Subsection USERS
contains user's configuration settings. Each user entry consists of the username:sha256(password):UID:filter_netmask(s)
. Value UID
represents the unique user identifier, where it is recommended to use values lower than 1000 for administrative accounts, while higher value for non-administrative accounts. The part filter_netmask(s)
represents the comma-delimited hard filter(s) that can be used to filter the shown events depending on the user account(s). Default entry is as follows:
Option UDP_ADDRESS
contains the server's log collecting listening address (Note: use 0.0.0.0
to listen on all interfaces), while option UDP_PORT
contains listening port value. If turned on, when used in combination with option LOG_SERVER
, it can be used for distinct (multiple) Sensor <-> Server architecture.
Same as for Sensor, when running the Server (e.g. python server.py
) for the first time and/or after a longer period of non-running, if option USE_SERVER_UPDATE_TRAILS
is set to true
, it will automatically update the trails from trail definitions (Note: stored inside the trails
directory). Its basic function is to store the log entries inside the logging directory (i.e. option LOG_DIR
) and provide the web reporting interface for presenting those same entries to the end-user (Note: there is no need install the 3rd party web server packages like Apache):
When entering the Server's reporting interface (i.e. via the address defined by options HTTP_ADDRESS
and HTTP_PORT
), user will be presented with the following authentication dialog. User has to enter the proper credentials that have been set by the server's administrator inside the configuration file maltrail.conf
(Note: default credentials are admin:changeme!
):
Once inside, user will be presented with the following reporting interface:
The top part holds a sliding timeline (Note: activated after clicking the current date label and/or the calendar icon ) where user can select logs for past events (Note: mouse over event will trigger display of tooltip with approximate number of events for current date). Dates are grouped by months, where 4 month period of data are displayed inside the widget itself. However, by using the provided slider (i.e. ) user can easily access events from previous months.
Once clicking the date, all events for that particular date should be loaded and represented by the client's web browser. Depending on number of events and the network connection speed, loading and display of logged events could take from couple of seconds, up to several minutes (e.g. 100,000 events takes around 5 seconds in total). For the whole processing time, animated loader will be displayed across the disabled user interface:
Middle part holds a summary of displayed events. Events
box represents total number of events in a selected 24-hour period, where red line represents IP-based events, blue line represents DNS-based events and yellow line represents URL-based events. Sources
box represents number of events per top sources in form of a stacked column chart, with total number of sources on top. Threats
box represents percentage of top threats in form of a pie chart (Note: gray area holds all threats having <1% in total events), with total number of threats on top. Trails
box represents percentage of top trails in form of a pie chart (Note: gray area holds all trails having <1% in total events), with total number of trails on top.
Each of those boxes are active, hence the click on one of those will result with a more detailed graph:
Bottom part holds a condensed representation of logged events in form of a paginated table. Each entry holds details for a single threat (Note: uniquely identified by a pair src_ip~trail
or dst_ip~trail
if the src_ip
is the same as the trail
- as in case of attacks coming from the outside):
Column threat
holds threat's unique ID (e.g. 85fdb08d
) and color (Note: extruded from the threat's ID), sensor
holds sensor name(s) where the event has been triggered (e.g. blitvenica
), events
holds total number of events for a current threat, first_seen
holds time of first event in a selected (24h) period (e.g. 06th 08:21:54
), last_seen
holds time of last event in a selected (24h) period (e.g. 06th 15:21:23
), src_ip
holds source IP(s) of a threat (e.g. 99.102.41.102
), src_port
holds source port(s) (e.g. 44556, 44589, 44601
), dst_ip
holds destination IP(s) (e.g. 213.202.100.28
), dst_port
holds destination port(s) (e.g. 80 (HTTP)
), proto
holds protocol(s), (e.g. TCP
), trail
holds a blacklist entry that triggered the event(s), info
holds more information about the threat/trail (e.g. attacker
for known attacker's IP addresses or ipinfo
for known IP information service commonly used by malware during a startup), reference
holds a source of the blacklisted entry (e.g. (static)
for static trails or myip.ms
for a dynamic feed retrieved from that same source) and tags
holds user defined tags for a given trail (e.g. APT28
).
When moving mouse over src_ip
and dst_ip
table entries, information tooltip is being displayed with detailed WHOIS information (Note: RIPE is the information provider):
Event details (e.g. src_port
, dst_port
, proto
, etc.) that differ inside same threat entry are condensed in form of a cloud icon (i.e. ). This is performed to get an usable reporting interface with as less rows as possible. Moving mouse over such icon will result in a display of an information tooltip with all items held (e.g. all port numbers being scanned by attacker
):
Clicking on one such icon will open a new dialog containing all stored items (Note: in their uncondensed form) ready to be Copy-Paste(d) for further analysis:
When hovering mouse pointer over the threat's trail for couple of seconds it will result in a frame consisted of results using the trail as a search term performed against DuckDuckGo search engine. In lots of cases, this provides basic information about the threat itself, eliminating the need for user to do the manual search for it. In upper right corner of the opened frame window there are two extra buttons. By clicking the first one (i.e. ), the resulting frame will be opened inside the new browser's tab (or window), while by clicking the second one (i.e. ) will immediately close the frame (Note: the same action is achieved by moving the mouse pointer outside the frame borders):
For each threat there is a column tag
that can be filled with arbitrary "tags" to closely describe all threats sharing the same trail. Also, it is a great way to describe threats individually, so all threats sharing the same tag (e.g. yahoo
) could be grouped out later:
In the following section some of the "usual suspects" scenarios will be described through the real-life cases.
Mass scans is a fairly common phenomenon where individuals and/or organizations give themselves a right to scan the whole 0.0.0.0/0 IP range (i.e. whole Internet) on a daily basis, with disclaimer where they say that if you don't like it then you should contact them privately to be skipped from future scans.
To make stuff worse, organizations as Shodan and ZoomEye give all results freely available (to other potential attackers) through their search engine. In the following screenshots you'll see details of Shodan scans in one single day.
Here is a reverse DNS lookup of the "attacker"'s address:
When hovering mouse pointer over the trail
column's content (IP address), you'll be presented with the search results from DuckDuckGo where you'll be able to find more information about the "attacker" (i.e. Shodan):
In the dst_ip
column, if you have a large organization, you'll be presented with large list of scanned IP addresses:
In the dst_port
column you'll be able to see all ports that have been scanned by such mass scans:
In other similar situations you'll see the same behaviour, coming from blacklisted individual attacker(s) (in this case by cinsscore.com):
One more common behaviour is scanning of the whole 0.0.0.0/0 IP range (i.e. Internet) in search for one particular port (e.g. TCP port 443 when Heartbleed has been found). In the following screenshot you'll find one such case for previously blacklisted attacker(s) (in this case by autoshun.org) targeting the UDP port 5060 (i.e. SIP) in search for misconfigured VoIP devices:
To spot the potential attackers hidden behind the Tor anonymity network, Maltrail utilizes publicly available list of Tor exit nodes. In the following screenshot you'll see a case where unknown attacker has been utilizing the Tor network to access the web target (over HTTPS) in our organization's range in rather suspicious way (total 599 connection requests in less than 10 minutes):
Fairly similar case to the previous one is when previously blacklisted attacker (in this case by autoshun.org) tries to access particular service in our organization's range in rather suspicious way (total 580 connection requests in 15 minutes):
If we enter the SSH attacker
to the Filter
field, we'll be able to see all similar occurrences for that day, but in this case for port 22 (i.e. SSH):
In the following case previously blacklisted attacker (in this case by autoshun.org) has scanned the whole organization's range in search for potentially vulnerable HTTPS service(s) (ports 443 and 8443):
In case of connection attempts coming from infected computers inside our organization toward already known C&C servers, you'll be able to find threats similar to the following (in this case CTBLocker ransomware):
In the following case file downloads from blacklisted (in this case by malwarepatrol.net) URL(s) have occurred:
If we enter the particular malware name (in this case Simda) into the Filter
field, only threats that are known to be linked to this malware will be filtered in (showing you all affected internal computers):
More generally, if we enter the malware
into the Filter
field, all threats that are known to be triggered by known malware trails (e.g. IP
addresses) will be filtered in:
Maltrail uses the static list of TLD domains that are known to be commonly involved in suspicious activities. Most such TLD domains are coming from free domain registrars (e.g. Freenom), hence they should be under greater scrutiny. In the following screenshot we can find a case where one such TLD domain .cm
has been used by unknown malware using the DGA algorithm to contact its C&C server(s):
There are also cases when perfectly valid TLD domains (e.g. .ru
) are used for suspicious activities, such in this case (e.g. long domain name (suspicious)
) where the domains are obviously DGA generated by unknown malware:
Maltrail uses static list of so-called "dynamic domains" that are often used in suspicious activities (e.g. for malware C&C servers that often change the destination's IP addresses):
Also, Maltrail uses static list of "onion"-related domains that are also often used in suspicious activities (e.g. malware contacting C&C servers by using Tor2Web service(s)):
In case of old and/or obsolete malware that sits undetected on organization's infected internal computers, there is often a "phenomenon" where malware continuously tries to contact the long dead C&C server's domain without any DNS resolution. Hence, those kind of (potential) threats will be marked as excessive no such domain name (suspicious)
:
In case that one trail is responsible for too many threats (e.g. in case of fake source IPs like in DNS amplification attacks), all similar threats will be grouped under a single flood
threat (Note: threat's ID will be marked with F0
), like in the following example:
Lots of malware uses some kind of ipinfo
service (e.g. ipinfo.io) to find out the victim's Internet IP address. In case of regular and especially in out-of-office hours, those kind of requests should be closely monitored, like in the following example:
By using filter ipinfo
all potentially infected computers in our organization's range can be listed that share this kind of suspicious behaviour:
Maltrail tracks all suspicious direct file download attempts (e.g. .apk
, .exe
and .scr
file extensions). This can trigger lots of false positives, but eventually could help in reconstruction of the chain of infection (Note: legitimate service providers, like Google, usually use encrypted HTTPS to perform this kind of downloads):
For testing purposes, web application "reconnaissance" tool skipfish has been used. As it tries to download blindly numerous .exe
files, the following related threat has been identified:
In case of suspicious requests coming from outer web application security scanners (e.g. searching for SQLi, XSS, LFI, etc. vulnerabilities) and/or the internal user malicious attempts toward unknown web sites, threats like the following could be found (real case of attackers trying to exploit Joomla! CMS CVE-2015-7297, CVE-2015-7857, and CVE-2015-7858 vulnerabilities):
In following example, legal (i.e. "in-house") web application vulnerability scanning run has been marked as "suspicious":
If we click on the cloud icon (i.e. ) for details and copy paste the whole content to a textual file, we'll be able to see all suspicious HTTP requests:
In the following screenshot, a run of popular SQLi vulnerability tool sqlmap can be found inside our logs:
In case of too many connection attempts toward considerable amount of different TCP ports, Maltrail will warn about the potential port scanning, as a result of its heuristic mechanism detection. It the following screenshot such warning(s) can be found for a run of popular port scanning tool nmap:
In the following example, it can be seen an overly suspicious behaviour, initiated by known attacker toward our organization's IP, utilizing large amount of traffic over unknown UDP service port(s):
Like in all other security solutions, Maltrail is prone to so-called "false positives". In those kind of cases, Maltrail will (especially in case of suspicious
threats) record a regular user's behaviour and mark it as malicious and/or suspicious. In the following example it can be seen that one of feed providers blocklist.de
marked regular Google servers as attacker
(s), resulting with the following threats:
In the following example, access to the perfectly valid .club
domains resulted with the following threat:
Nevertheless, administrator(s) should invest some extra time and check by himself whether the "suspicious" means malicious or not:
To properly run the Maltrail, Python 2.6.x or 2.7.x is required, together with pcapy (e.g. sudo apt-get install python-pcapy
). There are no other requirements, other than to run the Sensor component with the administrative/root privileges.
This software is provided under under a MIT License. See the accompanying LICENSE file for more information.
This project is a result of numerous hours of software development. If you appreciate the work and you want to see it further developed, please consider making a donation via PayPal to miroslav.stampar@gmail.com
or via Ƀitcoins to 1JCtgmpC1eWvdHXrKfvMAunfvcaaMXLP5G
. Thank you very much if you decide to do it.