To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???徇??怨??癲ぱ??筌?∥誼?┨ 001111110011111100111111100111000110110100111111001111111000100110000101001111110011111111100001100111111000001011001111001111110011111111100010101000110011111110000001011000011000101101100010001111111000010010110111 3f3f3f9c6d3f3f89853f3fe19f82cf3f3fe2a33f81618b623f84b7
EUC-JP ???徇??怨??癲ぱ??筌?‖誼?┨ 001111110011111100111111110101111100111000111111001111111011000111100101001111110011111111100010101000011010010011010001001111110011111111100100101001010011111110100001110000101011010111000011001111111010100010111001 3f3f3fd7ce3f3fb1e53f3fe2a1a4d13f3fe4a53fa1c2b5c33fa8b9
UTF-8 囹덈슢徇됵쭓怨뺤졅癲ぱ놁퐡筌뗫∥誼뷂┨ 111011111010011010101001111010111000110110001000111011001000101010100010111001011011111010000111111010111001000010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110000101111001111001100110110010111000111000000110110001111010111000011010000001111011011001000010100001111001111010110110001100111010111001011110101011111000101000100010100101111010001010101010111100111010111011011110000010111000101001010010101000 efa6a9eb8d88ec8aa2e5be87eb90b5ecad93e680a8ebbaa4eca185e799b2e381b1eb8681ed90a1e7ad8ceb97abe288a5e8aabcebb782e294a8
UHC 囹덈슢徇됵쭓怨뺤졅癲ぱ놁퐡筌뗫∥誼뷂┨ 1110011110101010100010001110101110011010101011101110001011011111100010011110111110100111100010111110101010110011100101011110110010100000101101101110111110100110101010101101000110000110111011001011110110001010111011111010011110001011111010111010000110101011111010111111111010010100111011111010011010111001 e7aa88eb9aaee2df89efa78beab395eca0b6efa6aad186ecbd8aefa78beba1abebfe94efa6b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)