To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 癲ル?淫??兪??}v癲ル?淫??兪??}vB 11100001100111111000001110001011001111111000100011111010001111110011111110011001011000000011111100111111011111010111011011100001100111111000001110001011001111111000100011111010001111110011111110011001011000000011111100111111011111010111011001000010 e19f838b3f88fa3f3f99603f3f7d76e19f838b3f88fa3f3f99603f3f7d7642
EUC-JP 癲ル?淫??兪??}v癲ル?淫??兪??}vB 11100010101000011010010111101011001111111011000011111100001111110011111111010001110000010011111100111111011111010111011011100010101000011010010111101011001111111011000011111100001111110011111111010001110000010011111100111111011111010111011001000010 e2a1a5eb3fb0fc3f3fd1c13f3f7d76e2a1a5eb3fb0fc3f3fd1c13f3f7d7642
UTF-8 癲ル슪淫졿뵺兪껊쭓}v癲ル슪淫졿뵺兪껊쭓}vB 1110011110011001101100101110001110000011101010111110110010001010101010101110011010110111101010111110110010100001101111111110101110110101101110101110010110000101101010101110101010111011100010101110110010101101100100110111110101110110111001111001100110110010111000111000001110101011111011001000101010101010111001101011011110101011111011001010000110111111111010111011010110111010111001011000010110101010111010101011101110001010111011001010110110010011011111010111011001000010 e799b2e383abec8aaae6b7abeca1bfebb5bae585aaeabb8aecad937d76e799b2e383abec8aaae6b7abeca1bfebb5bae585aaeabb8aecad937d7642
UHC 癲ル슪淫졿뵺兪껊쭓}v癲ル슪淫졿뵺兪껊쭓}vB 1110111110100110101010111110101110011010101100111110101111100010101000001110011010010100101110001110101011100100100000111110101110100111100010110111110101110110111011111010011010101011111010111001101010110011111010111110001010100000111001101001010010111000111010101110010010000011111010111010011110001011011111010111011001000010 efa6abeb9ab3ebe2a0e694b8eae483eba78b7d76efa6abeb9ab3ebe2a0e694b8eae483eba78b7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)