To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚?????蟻??語⑤?誼℡?怨??暗??? 100110100110101000111111001111110011111100111111001111111000101101100001001111110011111110001100111010101000011101000100001111111000101101100010100001111000010000111111100010011000010100111111001111111000100011000011001111110011111100111111 9a6a3f3f3f3f3f8b613f3f8cea87443f8b6287843f89853f3f88c33f3f3f
EUC-JP 嗚?????蟻??語??誼??怨??暗??? 11010011110010110011111100111111001111110011111100111111101101011100001000111111001111111011100011101100001111110011111110110101110000110011111100111111101100011110010100111111001111111011000011000101001111110011111100111111 d3cb3f3f3f3f3fb5c23f3fb8ec3f3fb5c33f3fb1e53f3fb0c53f3f3f
UTF-8 嗚삠굥栒뤸꼷蟻욎춷語⑤벡誼℡슫怨몃듌暗싎띿춷 111001011001011110011010111011001000001010100000111010101011010110100101111001101010000010010010111010111010010010111000111010101011110010110111111010001001111110111011111011001001101010001110111011001011011010110111111010001010101010011110111000101001000110100100111010111011001010100001111010001010101010111100111000101000010010100001111011001000101010101011111001101000000010101000111010111010101010000011111010111001001110001100111001101001101010010111111011001000101110001110111010111001110110111111111011001011011010110111 e5979aec82a0eab5a5e6a092eba4b8eabcb7e89fbbec9a8eecb6b7e8aa9ee291a4ebb2a1e8aabce284a1ec8aabe680a8ebaa83eb938ce69a97ec8b8eeb9dbfecb6b7
UHC 嗚삠굥栒뤸꼷蟻욎춷語⑤벡誼℡슫怨몃듌暗싎띿춷 1110011111110000101110111110001110000010100010111110001011100011100011111110011010000100100011111110101111111100100111101110110010101101100100111110010111011110101010001110101110111010101001001110101111111110101000101110010110011010101101001110101010110011101110001110101110001010101111111110010011011110100110101101000110001101111011001010110110010011 e7f0bbe3828be2e38fe6848febfc9eecad93e5dea8ebbaa4ebfea2e59ab4eab3b8eb8abfe4de9ad18decad93

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)