To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?淫??袁る?癲ル?揖??宋?????肄 1110000110011111100000111000101100111111100010001111101000111111001111111110010111001101100000101110100100111111111000011001111110000011100010110011111110010111010010110011111100111111100100010111011000111111001111110011111100111111001111111110001111100101 e19f838b3f88fa3f3fe5cd82e93fe19f838b3f974b3f3f91763f3f3f3f3fe3e5
EUC-JP 癲ル?淫??袁る?癲ル?揖??宋?????肄 1110001010100001101001011110101100111111101100001111110000111111001111111110101011001111101001001110101100111111111000101010000110100101111010110011111111001101101011000011111100111111110000011101011100111111001111110011111100111111001111111110011011100111 e2a1a5eb3fb0fc3f3feacfa4eb3fe2a1a5eb3fcdac3f3fc1d73f3f3f3f3fe6e7
UTF-8 癲ル슪淫졿뵺袁る쑂癲ル슣揖띈린宋묎턀醴븐슙肄 111001111001100110110010111000111000001110101011111011001000101010101010111001101011011110101011111011001010000110111111111010111011010110111010111010001010001010000001111000111000001010001011111011001001000110000010111001111001100110110010111000111000001110101011111011001000101010100011111001101000111110010110111010111001110110001000111010111010011010110000111001011010111010001011111010111010110010001110111011011000010010000000111011111010011010110111111010111011100010010000111011001000101010011001111010001000001010000100 e799b2e383abec8aaae6b7abeca1bfebb5bae8a281e3828bec9182e799b2e383abec8aa3e68f96eb9d88eba6b0e5ae8bebac8eed8480efa6b7ebb890ec8a99e88284
UHC 癲ル슪淫졿뵺袁る쑂癲ル슣揖띈린宋묎턀醴븐슙肄 1110111110100110101010111110101110011010101100111110101111100010101000001110011010010100101110001110101010111110101010101110101110011100101000101110111110100110101010111110101110011010101011111110101111100111101101101110100010111000101100001110000111100100100100011110101010110101100111001110011111100100101110101110110010011010101001111110110010111101 efa6abeb9ab3ebe2a0e694b8eabeaaeb9ca2efa6abeb9aafebe7b6e8b8b0e1e491eab59ce7e4baec9aa7ecbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)