To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 暗????ぜ轅??筌??誼?????暗??? 10001000110000110011111100111111001111110011111110000010101110101110011101110110001111110011111111100010101000110011111100111111100010110110001000111111001111110011111100111111001111111000100011000011001111110011111100111111 88c33f3f3f3f82bae7763f3fe2a33f3f8b623f3f3f3f3f88c33f3f3f
EUC-JP 暗??佾?ぜ轅??筌??誼?????暗??? 101100001100010100111111001111111000111110110000111110110011111110100100101111001110110111010111001111110011111111100100101001010011111100111111101101011100001100111111001111110011111100111111001111111011000011000101001111110011111100111111 b0c53f3f8fb0fb3fa4bcedd73f3fe4a53f3fb5c33f3f3f3f3fb0c53f3f3f
UTF-8 暗삳쉰佾볢ぜ轅대븸筌먲퐣誼욃냶栒삳샍暗싎띿춷 111001101001101010010111111011001000001010110011111011001000100110110000111001001011110110111110111010111011001110100010111000111000000110011100111010001011110110000101111010111000110010000000111010111011100010111000111001111010110110001100111010111010100010110010111011011001000010100011111010001010101010111100111011001001101010000011111010111000001110110110111001101010000010010010111011001000001010110011111011001000001110001101111001101001101010010111111011001000101110001110111010111001110110111111111011001011011010110111 e69a97ec82b3ec89b0e4bdbeebb3a2e3819ce8bd85eb8c80ebb8b8e7ad8ceba8b2ed90a3e8aabcec9a83eb83b6e6a092ec82b3ec838de69a97ec8b8eeb9dbfecb6b7
UHC 暗삳쉰佾볢ぜ轅대븸筌먲퐣誼욃냶栒삳샍暗싎띿춷 1110010011011110101110111110101110111101101011101110110011101011100100111110100010101010101111001110101010111111101101001110101110010101101000011110111110100111100100001110111110111101100011001110101111111110100111101110010110000110100001101110001011100011101110111110101110011000101110111110010011011110100110101101000110001101111011001010110110010011 e4debbebbdaeeceb93e8aabceabfb4eb95a1efa790efbd8cebfe9ee58686e2e3bbeb98bbe4de9ad18decad93

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)