To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???徇??怨??癲??誼??急???怨??夜 001111110011111100111111100111000110110100111111001111111000100110000101001111110011111111100001100111110011111100111111100010110110001000111111001111111000101101111101001111110011111100111111100010011000010100111111001111111001011011101001 3f3f3f9c6d3f3f89853f3fe19f3f3f8b623f3f8b7d3f3f3f89853f3f96e9
EUC-JP ???徇??怨??癲??誼??急薏??怨??夜 0011111100111111001111111101011111001110001111110011111110110001111001010011111100111111111000101010000100111111001111111011010111000011001111110011111110110101110111101000111111011001110111100011111100111111101100011110010100111111001111111100110011101011 3f3f3fd7ce3f3fb1e53f3fe2a13f3fb5c33f3fb5de8fd9de3f3fb1e53f3fcceb
UTF-8 囹덈슢徇됵쭓怨뺤졅癲쀬빖誼요튋急薏껓쭓怨뺤젴夜 111011111010011010101001111010111000110110001000111011001000101010100010111001011011111010000111111010111001000010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110000101111001111001100110110010111011001000000010101100111010111011100110010110111010001010101010111100111011001001101010010100111011011000101010001011111001101000000010100101111010001001011010001111111010101011101110010011111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010110100111001011010010010011100 efa6a9eb8d88ec8aa2e5be87eb90b5ecad93e680a8ebbaa4eca185e799b2ec80acebb996e8aabcec9a94ed8a8be680a5e8968feabb93ecad93e680a8ebbaa4eca0b4e5a49c
UHC 囹덈슢徇됵쭓怨뺤졅癲쀬빖誼요튋急薏껓쭓怨뺤젴夜 11100111101010101000100011101011100110101010111011100010110111111000100111101111101001111000101111101010101100111001010111101100101000001011011011101111101001101001011111101100100101011011100011101011111111101011111111100100101110011001111111010000111000011110101111111011100000111110111110100111100010111110101010110011100101011110110010100000101010001110010110101000 e7aa88eb9aaee2df89efa78beab395eca0b6efa697ec95b8ebfebfe4b99fd0e1ebfb83efa78beab395eca0a8e5a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)