To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z}?????????z{^ 0011111100111111001111110011111100111111001111110011111100111111001111110111101001111101001111110011111100111111001111110011111100111111001111110011111100111111011110100111101101011110 3f3f3f3f3f3f3f3f3f7a7d3f3f3f3f3f3f3f3f3f7a7b5e
SJIS-WIN 陋帷鮒闢夂弗迴守鮒z}陋帷鮒闢夂弗迴守鮒z{^ 1110100010011011100110111110011110010101101010011110100010010011100110101110011110010101101001001110011110001111100011101110011110010101101010010111101001111101111010001001101110011011111001111001010110101001111010001001001110011010111001111001010110100100111001111000111110001110111001111001010110101001011110100111101101011110 e89b9be795a9e8939ae795a4e78f8ee795a97a7de89b9be795a9e8939ae795a4e78f8ee795a97a7b5e
EUC-JP 陋帷鮒闢夂弗迴守鮒z}陋帷鮒闢夂弗迴守鮒z{^ 1110111111111011110101101110100111001010101010111110111111110011110101001110100111001010101001101110110111101111101111001110100111001010101010110111101001111101111011111111101111010110111010011100101010101011111011111111001111010100111010011100101010100110111011011110111110111100111010011100101010101011011110100111101101011110 effbd6e9caabeff3d4e9caa6edefbce9caab7a7deffbd6e9caabeff3d4e9caa6edefbce9caab7a7b5e
UTF-8 陋帷鮒闢夂弗迴守鮒z}陋帷鮒闢夂弗迴守鮒z{^ 1110100110011001100010111110010110111000101101111110100110101110100100101110100110010111101000101110010110100100100000101110010110111100100101111110100010111111101101001110010110101110100010001110100110101110100100100111101001111101111010011001100110001011111001011011100010110111111010011010111010010010111010011001011110100010111001011010010010000010111001011011110010010111111010001011111110110100111001011010111010001000111010011010111010010010011110100111101101011110 e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e5ae88e9ae927a7de9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e5ae88e9ae927a7b5e
UHC 陋??闢?弗?守?z}陋??闢?弗?守?z{^ 11010111101100000011111100111111110111001010001100111111110111011101011100111111111000011111101000111111011110100111110111010111101100000011111100111111110111001010001100111111110111011101011100111111111000011111101000111111011110100111101101011110 d7b03f3fdca33fddd73fe1fa3f7a7dd7b03f3fdca33fddd73fe1fa3f7a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)