To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 儼???業??艤?????畏?〕飮??議?? 1001100101010110001111110011111100111111100010111100011000111111001111111110010001111110001111110011111100111111001111110011111110001000110110000011111110000001011011001001111101011010001111110011111110001011011000110011111100111111 99563f3f3f8bc63f3fe47e3f3f3f3f3f88d83f816c9f5a3f3f8b633f3f
EUC-JP 儼???業??艤?????畏?〕飮??議?? 1101000110110111001111110011111100111111101101101100100000111111001111111110011111011111001111110011111100111111001111110011111110110000110110100011111110100001110011011101110110111011001111110011111110110101110001000011111100111111 d1b73f3f3fb6c83f3fe7df3f3f3f3f3fb0da3fa1cdddbb3f3fb5c43f3f
UTF-8 儼벿띤뜙業산낟艤앾쬇戮⑺닟畏븍〕飮닸콢議우묾 111001011000010010111100111010111011001010111111111010111001110110100100111010111001110010011001111001101010010110101101111011001000001010110000111010111000001010011111111010001000100110100100111011001001010110111110111011001010110010000111111011111010011110010010111000101001000110111010111010111000101110011111111001111001010110001111111010111011100010001101111000111000000010010101111010011010001110101110111010111000101110111000111011001011110110100010111010001010110110110000111011001001101010110000111010111010110010111110 e584bcebb2bfeb9da4eb9c99e6a5adec82b0eb829fe889a4ec95beecac87efa792e291baeb8b9fe7958febb88de38095e9a3aeeb8bb8ecbda2e8adb0ec9ab0ebacbe
UHC 儼벿띤뜙業산낟艤앾쬇戮⑺닟畏븍〕飮닸콢議우묾 1110010111110000100100111100111010110110111011011000110110011100111001011111011010111011111010101011001110101110111010111111101010011101111011111010011010011110111010111011110110101001111011011000100010011111111010001110011010111010111010111010000110110011111010111110011010110100111001101011000110011010111011001010000110111111111011001011100110110010 e5f093ceb6ed8d9ce5f6bbeab3aeebfa9defa69eebbda9ed889fe8e6baeba1b3ebe6b4e6b19aeca1bfecb9b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)