To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??泣??揄??艶l??⑥?榮??猷 100110100110101000111111001111111000101110000011001111110011111110011101100010010011111100111111100010011001000010000010100011000011111100111111100001110100010100111111100111101100010000111111001111111001011101010001 9a6a3f3f8b833f3f9d893f3f8990828c3f3f87453f9ec43f3f9751
EUC-JP 嗚??泣??揄??艶l?洹??榮??猷 11010011110010110011111100111111101101011110001100111111001111111101100111101001001111110011111110110001111100001010001111101100001111111000111111000111101110100011111100111111110111001100011000111111001111111100110110110010 d3cb3f3fb5e33f3fd9e93f3fb1f0a3ec3f8fc7ba3f3fdcc63f3fcdb2
UTF-8 嗚삠굦泣쒍뭄揄앸짎艶l뫆洹⑥춾榮붽퍓猷 111001011001011110011010111011001000001010100000111010101011010110100110111001101011001110100011111011001001001010001101111010111010110110000100111001101000111110000100111011001001010110111000111011001010011110001110111010001000100110110110111011111011110110001100111010111010101110000110111001101011010010111001111000101001000110100101111011001011011010111110111001101010011010101110111010111011011010111101111011011000110110010011111001111000110010110111 e5979aec82a0eab5a6e6b3a3ec928debad84e68f84ec95b8eca78ee889b6efbd8cebab86e6b4b9e291a5ecb6bee6a6aeebb6bded8d93e78cb7
UHC 嗚삠굦泣쒍뭄揄앸짎艶l뫆洹⑥춾榮붽퍓猷 1110011111110000101110111110001110000010100011001110101111101000100111001110010010111001101100111110101011110001100111011110101110100011100110101110011011111101101000111110110010010001101010011110101010110111101010001110110010101101100110101110011110110100100101001110101010111011100010101110101110100011 e7f0bbe3828cebe89ce4b9b3eaf19deba39ae6fda3ec91a9eab7a8ecad9ae7b494eabb8aeba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)