To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????{?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011110110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7b3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 陋帷鮒闢夂弗迴守鮒{陋帷鮒闢夂弗迴守鮒{^ 111010001001101110011011111001111001010110101001111010001001001110011010111001111001010110100100111001111000111110001110111001111001010110101001011110111110100010011011100110111110011110010101101010011110100010010011100110101110011110010101101001001110011110001111100011101110011110010101101010010111101101011110 e89b9be795a9e8939ae795a4e78f8ee795a97be89b9be795a9e8939ae795a4e78f8ee795a97b5e
EUC-JP 陋帷鮒闢夂弗迴守鮒{陋帷鮒闢夂弗迴守鮒{^ 111011111111101111010110111010011100101010101011111011111111001111010100111010011100101010100110111011011110111110111100111010011100101010101011011110111110111111111011110101101110100111001010101010111110111111110011110101001110100111001010101001101110110111101111101111001110100111001010101010110111101101011110 effbd6e9caabeff3d4e9caa6edefbce9caab7beffbd6e9caabeff3d4e9caa6edefbce9caab7b5e
UTF-8 陋帷鮒闢夂弗迴守鮒{陋帷鮒闢夂弗迴守鮒{^ 111010011001100110001011111001011011100010110111111010011010111010010010111010011001011110100010111001011010010010000010111001011011110010010111111010001011111110110100111001011010111010001000111010011010111010010010011110111110100110011001100010111110010110111000101101111110100110101110100100101110100110010111101000101110010110100100100000101110010110111100100101111110100010111111101101001110010110101110100010001110100110101110100100100111101101011110 e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e5ae88e9ae927be9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e5ae88e9ae927b5e
UHC 陋??闢?弗?守?{陋??闢?弗?守?{^ 1101011110110000001111110011111111011100101000110011111111011101110101110011111111100001111110100011111101111011110101111011000000111111001111111101110010100011001111111101110111010111001111111110000111111010001111110111101101011110 d7b03f3fdca33fddd73fe1fa3f7bd7b03f3fdca33fddd73fe1fa3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)