To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嗚??洵??額ゃ?^ 1001101001101010001111110011111110011111101010110011111100111111100010100111101010000010111000010011111101011110 9a6a3f3f9fab3f3f8a7a82e13f5e
EUC-JP 嗚??洵??額ゃ?^ 1101001111001011001111110011111111011110101011010011111100111111101100111101101110100100111000110011111101011110 d3cb3f3fdead3f3fb3dba4e33f5e
UTF-8 嗚멨젶洵⑵뮯額ゃ돡^ 11100101100101111001101011101011101010011010100011101100101000001011011011100110101101001011010111100010100100011011010111101011101011101010111111101001101000011000110111100011100000101000001111101011100011111010000101011110 e5979aeba9a8eca0b6e6b4b5e291b5ebaeafe9a18de38283eb8fa15e
UHC 嗚멨젶洵⑵뮯額ゃ돡^ 11100111111100001011100011100101101000001010101011100010111001111010100111101000100100101011100011100100111111101010101011100011100010011010011001011110 e7f0b8e5a0aae2e7a9e892b8e4feaae389a65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)