To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?訥こ??莎ぱ?晙?訥こ??莎ぱ??^ 00111111111001100110001110000010101100010011111100111111111001001011001110000010110011110011111111111010110101110011111111100110011000111000001010110001001111110011111111100100101100111000001011001111001111110011111101011110 3fe66382b13f3fe4b382cf3ffad73fe66382b13f3fe4b382cf3f3f5e
EUC-JP ?訥こ??莎ぱ?晙?訥こ??莎ぱ?寯^ 00111111111010111100010010100100101100110011111100111111111010001011010110100100110100010011111110001111110000101011101000111111111010111100010010100100101100110011111100111111111010001011010110100100110100010011111110001111101110101110010101011110 3febc4a4b33f3fe8b5a4d13f8fc2ba3febc4a4b33f3fe8b5a4d13f8fbae55e
UTF-8 룶訥こ룶깹莎ぱ룶晙룶訥こ룶깹莎ぱ룶寯^ 11101011101000111011011011101000101010001010010111100011100000011001001111101011101000111011011011101010101110011011100111101000100011101000111011100011100000011011000111101011101000111011011011100110100110011001100111101011101000111011011011101000101010001010010111100011100000011001001111101011101000111011011011101010101110011011100111101000100011101000111011100011100000011011000111101011101000111011011011100101101011111010111101011110 eba3b6e8a8a5e38193eba3b6eab9b9e88e8ee381b1eba3b6e69999eba3b6e8a8a5e38193eba3b6eab9b9e88e8ee381b1eba3b6e5afaf5e
UHC 룶訥こ룶깹莎ぱ룶晙룶訥こ룶깹莎ぱ룶寯^ 10001111101010111101001011101101101010101011001110001111101010111011001010100001110111101110110110101010110100011000111110101011111100011101101110001111101010111101001011101101101010101011001110001111101010111011001010100001110111101110110110101010110100011000111110101011111100011101100101011110 8fabd2edaab38fabb2a1deedaad18fabf1db8fabd2edaab38fabb2a1deedaad18fabf1d95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)