To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??揄?????維??淫??榮?? 111000011001111100111111001111111000101101011000001111110011111110011101100010010011111100111111001111110011111100111111100010001101101100111111001111111000100011111010001111110011111110011110110001000011111100111111 e19f3f3f8b583f3f9d893f3f3f3f3f88db3f3f88fa3f3f9ec43f3f
EUC-JP 癲??宜??揄?????維??淫??榮?? 111000101010000100111111001111111011010110111001001111110011111111011001111010010011111100111111001111110011111100111111101100001101110100111111001111111011000011111100001111110011111111011100110001100011111100111111 e2a13f3fb5b93f3fd9e93f3f3f3f3fb0dd3f3fb0fc3f3fdcc63f3f
UTF-8 癲덈챶宜룬씘揄우물醴븐뼦維쏉쬊淫뚮눀榮싷쭎 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111010001110101100111011001001010010011000111001101000111110000100111011001001101010110000111010111010110010111100111011111010011010110111111010111011100010010000111010111011110010100110111001111011011010101101111011001000111110001001111011001010110010001010111001101011011110101011111010111001101010101110111010111000100010000000111001101010011010101110111011001000101110110111111011001010110110001110 e799b2eb8d88ecb1b6e5ae9ceba3acec9498e68f84ec9ab0ebacbcefa6b7ebb890ebbca6e7b6adec8f89ecac8ae6b7abeb9aaeeb8880e6a6aeec8bb7ecad8e
UHC 癲덈챶宜룬씘揄우물醴븐뼦維쏉쬊淫뚮눀榮싷쭎 111011111010011010001000111010111010101010000011111010111111000110110111111010011001110110101101111010101111000110111111111011001011100110110000111001111110010010111010111011001001011010101001111010111010101110011011111011111010011010100000111010111110001010001100111010111000011110100001111001111011010010011010111011111010011110000111 efa688ebaa83ebf1b7e99dadeaf1bfecb9b0e7e4baec96a9ebab9befa6a0ebe28ceb87a1e7b49aefa787

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)