{"id":598,"date":"2022-06-13T14:48:19","date_gmt":"2022-06-13T21:48:19","guid":{"rendered":"https:\/\/jasonsblog.access.ly\/?p=598"},"modified":"2022-11-05T07:38:05","modified_gmt":"2022-11-05T14:38:05","slug":"install-the-yacy-search-engine-with-docker","status":"publish","type":"post","link":"https:\/\/jasonsblog.ddns.net\/index.php\/2022\/06\/13\/install-the-yacy-search-engine-with-docker\/","title":{"rendered":"Install the Yacy Search Engine with Docker"},"content":{"rendered":"\n<p><a href=\"https:\/\/yacy.net\/\" target=\"_blank\" aria-label=\"Yacy (opens in a new tab)\" rel=\"noreferrer noopener\" class=\"ek-link\">Yacy<\/a> is a promising distributed search engine project. You can run Yacy as a P2P, peer to peer, search engine, or as your very own search engine with web crawler, or as an intranet, local search engine. However, this is not ready to be your primary search engine as there is something to the major search engines and their algorithms. But this is an interesting project to play around with and check back on over time. Maybe one day this will be good enough to replace the major search engines and your privacy search proxies. But since you can create your own custom web crawls, it can be an interesting tool for specific research projects now.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>Decentralization<\/p>\n\n\n\n<p>\nImagine if, rather than relying on the proprietary software of a large professional search engine operator, your search engine was run by many private computers which aren&#8217;t under the control of any one company or individual. <\/p>\n\n\n\n<p>Well, that&#8217;s what YaCy does!\n<\/p>\n<\/blockquote>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"720\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/NetworkPicture-1.png\" alt=\"\" class=\"wp-image-1818\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/NetworkPicture-1.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/NetworkPicture-1-300x211.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/NetworkPicture-1-768x540.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Yacy Search Peers<\/figcaption><\/figure>\n\n\n\n<p>To install your own instance of Yacy in docker:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>docker run -d --name yacy_search_server -p 8090:8090 -p 8443:8443 -v yacy_search_server_data:\/opt\/yacy_search_server\/DATA --restart unless-stopped --log-opt max-size=200m --log-opt max-file=2 yacy\/yacy_search_server:latest<\/code><\/pre>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"784\" height=\"420\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_140423.png\" alt=\"\" class=\"wp-image-1794\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_140423.png 784w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_140423-300x161.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_140423-768x411.png 768w\" sizes=\"auto, (max-width: 784px) 100vw, 784px\" \/><\/figure>\n\n\n\n<p>Once installed you can access by ip:8090.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"839\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134039-1024x839.png\" alt=\"\" class=\"wp-image-1801\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134039-1024x839.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134039-300x246.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134039-768x629.png 768w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134039.png 1375w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>The default login for the docker module is admin:yacy (change password after install). When you login you can go to the administration console.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"839\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134138-1024x839.png\" alt=\"\" class=\"wp-image-1800\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134138-1024x839.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134138-300x246.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134138-768x629.png 768w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134138.png 1375w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>And this will show that the results aren&#8217;t necessarily as good as the main search engines or privacy proxies.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"839\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134918-1024x839.png\" alt=\"\" class=\"wp-image-1799\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134918-1024x839.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134918-300x246.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134918-768x629.png 768w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_134918.png 1375w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>If you want to improve the results, you can start your own web crawl. <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"839\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_145646-1024x839.png\" alt=\"\" class=\"wp-image-1798\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_145646-1024x839.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_145646-300x246.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_145646-768x629.png 768w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_145646.png 1375w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>I started a web crawl of docker.com and you can track the result and there is even a graphical display of its progress.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"839\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_150049-1024x839.png\" alt=\"\" class=\"wp-image-1797\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_150049-1024x839.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_150049-300x246.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_150049-768x629.png 768w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220613_150049.png 1375w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>A <a aria-label=\"scathing review of Yacy from 2021 (opens in a new tab)\" href=\"https:\/\/linuxreviews.org\/The_YaCy_Search_Server_Is_Sort-Of_Being_Actively_Developed_Again_After_Half_A_Decade_Of_Inactivity\" target=\"_blank\" rel=\"noreferrer noopener\" class=\"ek-link\">scathing review of Yacy from 2021<\/a> showed that development was just starting back up, and that the project was in desperate need of Java programmers. Looking at the <a href=\"https:\/\/github.com\/yacy\/yacy_search_server\" target=\"_blank\" aria-label=\"Yacy Github page (opens in a new tab)\" rel=\"noreferrer noopener\" class=\"ek-link\">Yacy Github page<\/a> there seems to only be a couple people actively contributing to the project. <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"778\" src=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_095855-1024x778.png\" alt=\"\" class=\"wp-image-1795\" srcset=\"https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_095855-1024x778.png 1024w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_095855-300x228.png 300w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_095855-768x583.png 768w, https:\/\/jasonsblog.ddns.net\/wp-content\/uploads\/2022\/11\/Screenshot_20220614_095855.png 1326w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>We can only hope that this project captures the attention of some talented Java programmers and develops further. Consequently, distributed systems are the way of the future just like the cryptocurrency Bitcoin, with distributed mining around the world so no one government can shut it down or control it. So perhaps one day as Yacy&#8217;s development continues, it can be a replacement for the major search engines that have already shown their propensity for censoring content and using their platforms to manipulate people regarding elections and other societal changes. I&#8217;m reminded that Google had to remove their code of conduct statement &#8220;<a rel=\"noreferrer noopener\" href=\"https:\/\/en.wikipedia.org\/wiki\/Don%27t_be_evil\" target=\"_blank\">Don&#8217;t be evil<\/a>&#8221; as they changed it to &#8220;Do the Right Thing&#8221;. Whose right thing? Because it&#8217;s certainly not THE right thing when I look through results on certain topics. <\/p>\n","protected":false},"excerpt":{"rendered":"<p>Yacy is a promising distributed search engine project. You can run Yacy as a P2P, peer to peer, search engine, or as your very own search engine with web crawler, or as an intranet, local search engine. However, this is not ready to be your primary search engine as there is something to the major [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-598","post","type-post","status-publish","format-standard","hentry","category-tech"],"blocksy_meta":{"styles_descriptor":{"styles":{"desktop":"","tablet":"","mobile":""},"google_fonts":[],"version":6}},"featured_image_src":null,"author_info":{"display_name":"Jason","author_link":"https:\/\/jasonsblog.ddns.net\/index.php\/author\/jturning\/"},"_links":{"self":[{"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/posts\/598","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/comments?post=598"}],"version-history":[{"count":12,"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/posts\/598\/revisions"}],"predecessor-version":[{"id":1892,"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/posts\/598\/revisions\/1892"}],"wp:attachment":[{"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/media?parent=598"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/categories?post=598"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jasonsblog.ddns.net\/index.php\/wp-json\/wp\/v2\/tags?post=598"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}