To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 儼??泣?┸???儼??泣?┸???^ 10011001010101100011111100111111100010111000001100111111100001001011110100111111001111110011111110011001010101100011111100111111100010111000001100111111100001001011110100111111001111110011111101011110 99563f3f8b833f84bd3f3f3f99563f3f8b833f84bd3f3f3f5e
EUC-JP 儼??泣?┸洧??儼??泣?┸洧??^ 1101000110110111001111110011111110110101111000110011111110101000101111111000111111000111101101000011111100111111110100011011011100111111001111111011010111100011001111111010100010111111100011111100011110110100001111110011111101011110 d1b73f3fb5e33fa8bf8fc7b43f3fd1b73f3fb5e33fa8bf8fc7b43f3f5e
UTF-8 儼볥톪泣됵┸洧룹툗儼볥톪泣됵┸洧룸연^ 11100101100001001011110011101011101100111010010111101101100001101010101011100110101100111010001111101011100100001011010111100010100101001011100011100110101101001010011111101011101000111011100111101101100010001001011111100101100001001011110011101011101100111010010111101101100001101010101011100110101100111010001111101011100100001011010111100010100101001011100011100110101101001010011111101011101000111011100011101100100101111011000001011110 e584bcebb3a5ed86aae6b3a3eb90b5e294b8e6b4a7eba3b9ed8897e584bcebb3a5ed86aae6b3a3eb90b5e294b8e6b4a7eba3b8ec97b05e
UHC 儼볥톪泣됵┸洧룹툗儼볥톪泣됵┸洧룸연^ 11100101111100001001001111101011101101111000001011101011111010001000100111101111101001101011111111101010111110111011011111101100101110001000111011100101111100001001001111101011101101111000001011101011111010001000100111101111101001101011111111101010111110111011011111101011101111111010110001011110 e5f093ebb782ebe889efa6bfeafbb7ecb88ee5f093ebb782ebe889efa6bfeafbb7ebbfac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)