To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8踰??攸??汝??臾??域??唯?┃ 111000011001111100111111100000100101011111100110111110100011111100111111100111011011111100111111001111111001001111110000001111110011111111100100011010110011111100111111100010001110011000111111001111111001011101000010001111111000010010101011 e19f3f8257e6fa3f3f9dbf3f3f93f03f3fe46b3f3f88e63f3f97423f84ab
EUC-JP 癲?8踰??攸??汝??臾??域??唯?┃ 111000101010000100111111101000111011100011101100111111000011111100111111110110101100000100111111001111111100011011110010001111110011111111100111110011000011111100111111101100001110100000111111001111111100110110100011001111111010100010101101 e2a13fa3b8ecfc3f3fdac13f3fc6f23f3fe7cc3f3fb0e83f3fcda33fa8ad
UTF-8 癲쒕8踰딂굢攸곸냸汝븃짆臾딅쇀域㏓슢唯롩┃ 111001111001100110110010111011001001001010010101111011111011110010011000111010001011100010110000111010111001010010000010111010101011010110100010111001101001010010111000111010101011001110111000111010111000001110111000111001101011000110011101111010111011100010000011111011001010011110000110111010001000011110111110111010111001010010000101111011001000011110000000111001011001111110011111111000111000111110010011111011001000101010100010111001011001010010101111111010111010000110101001111000101001010010000011 e799b2ec9295efbc98e8b8b0eb9482eab5a2e694b8eab3b8eb83b8e6b19debb883eca786e887beeb9485ec8780e59f9fe38f93ec8aa2e594afeba1a9e29483
UHC 癲쒕8踰딂굢攸곸냸汝븃짆臾딅쇀域㏓슢唯롩┃ 111011111010011010011100111010111010001110111000111010111011001010001010111010001000001010001001111010101111001010000001111011001000011010001000111001101010001110111010111010001010001110010101111010111010110010001010111010111001100110110100111001101011010010100111111010111001101010101110111010101110011010001110111010011010011010101101 efa69ceba3b8ebb28ae88289eaf281ec8688e6a3bae8a395ebac8aeb99b4e6b4a7eb9aaeeae68ee9a6ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)