To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??泣??喩????????馭??衣?? 1001101001101010001111110011111110001011100000110011111100111111100110100110011100111111001111110011111100111111001111110011111100111111001111111110100101100110001111110011111110001000110111110011111100111111 9a6a3f3f8b833f3f9a673f3f3f3f3f3f3f3fe9663f3f88df3f3f
EUC-JP 嗚??泣??喩?????洹??馭??衣?? 11010011110010110011111100111111101101011110001100111111001111111101001111001000001111110011111100111111001111110011111110001111110001111011101000111111001111111111000111000111001111110011111110110000111000010011111100111111 d3cb3f3fb5e33f3fd3c83f3f3f3f3f8fc7ba3f3ff1c73f3fb0e13f3f
UTF-8 嗚삠굦泣쒏껸喩볦쭍若뗫쮩洹앷콟馭귙꺃衣썹뜏 111001011001011110011010111011001000001010100000111010101011010110100110111001101011001110100011111011001001001010001111111010101011101110111000111001011001011010101001111010111011001110100110111011001010110110001101111011111010010110110100111010111001011110101011111011001010111010101001111001101011010010111001111011001001010110110111111011001011110110011111111010011010011010101101111010101011011110011001111010101011101010000011111010001010000110100011111011001000110110111001111010111001110010001111 e5979aec82a0eab5a6e6b3a3ec928feabbb8e596a9ebb3a6ecad8defa5b4eb97abecaea9e6b4b9ec95b7ecbd9fe9a6adeab799eaba83e8a1a3ec8db9eb9c8f
UHC 嗚삠굦泣쒏껸喩볦쭍若뗫쮩洹앷콟馭귙꺃衣썹뜏 111001111111000010111011111000111000001010001100111010111110100010011100111001101011001010111001111010101110011110010011111011001010011110000110111001011010111010001011111010111010100010000110111010101011011110011101111010101011000110010111111001011101111110000010111000111000001110101100111010111111110110111101111001111000110110010010 e7f0bbe3828cebe89ce6b2b9eae793eca786e5ae8beba886eab79deab197e5df82e383acebfdbde78d92

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)