To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???筌??揖??魏?????溢??淫?? 11100001100111110011111100111111001111111110001010100011001111110011111110010111010010110011111100111111111010011011000000111111001111110011111100111111001111111000100011101100001111110011111110001000111110100011111100111111 e19f3f3f3fe2a33f3f974b3f3fe9b03f3f3f3f3f88ec3f3f88fa3f3f
EUC-JP 癲???筌??揖??魏?????溢??淫?? 11100010101000010011111100111111001111111110010010100101001111110011111111001101101011000011111100111111111100101011001000111111001111110011111100111111001111111011000011101110001111110011111110110000111111000011111100111111 e2a13f3f3fe4a53f3fcdac3f3ff2b23f3f3f3f3fb0ee3f3fb0fc3f3f
UTF-8 癲쑳살뵢筌묒뇯揖먨뭣魏뉖츏嶺뚮뿰溢졿뇡淫볦춷 111001111001100110110010111011001001000110110011111011001000001010110100111010111011010110100010111001111010110110001100111010111010110010010010111010111000011110101111111001101000111110010110111010111010100010101000111010111010110110100011111010011010110110001111111010111000100110010110111011001011100010001111111011111010011010101011111010111001101010101110111010111011111110110000111001101011101010100010111011001010000110111111111010111000011110100001111001101011011110101011111010111011001110100110111011001011011010110111 e799b2ec91b3ec82b4ebb5a2e7ad8cebac92eb87afe68f96eba8a8ebada3e9ad8feb8996ecb88fefa6abeb9aaeebbfb0e6baa2eca1bfeb87a1e6b7abebb3a6ecb6b7
UHC 癲쑳살뵢筌묒뇯揖먨뭣魏뉖츏嶺뚮뿰溢졿뇡淫볦춷 1110111110100110100111001100111010111011111011001001010010100010111011111010011110010001111011001000011110010100111010111110011110010000111001011011100110111101111010101110000010000111111010111010111010001010111001111010110110001100111010111001011110110000111011001110111010100000111001101000011110001001111010111110001010010011111011001010110110010011 efa69ccebbec94a2efa791ec8794ebe790e5b9bdeae087ebae8ae7ad8ceb97b0eceea0e68789ebe293ecad93

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)