To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???d?????????d??????B 001111110011111100111111011001000011111100111111001111110011111100111111001111110011111100111111001111110110010000111111001111110011111100111111001111110011111101000010 3f3f3f643f3f3f3f3f3f3f3f3f643f3f3f3f3f3f42
SJIS-WIN 潁〓?d???筌??潁〓?d???筌??B 100111111111000110000001101011000011111101100100001111110011111100111111111000101010001100111111001111111001111111110001100000011010110000111111011001000011111100111111001111111110001010100011001111110011111101000010 9ff181ac3f643f3f3fe2a33f3f9ff181ac3f643f3f3fe2a33f3f42
EUC-JP 潁〓?d???筌??潁〓?d???筌??B 110111101111001110100010101011100011111101100100001111110011111100111111111001001010010100111111001111111101111011110011101000101010111000111111011001000011111100111111001111111110010010100101001111110011111101000010 def3a2ae3f643f3f3fe4a53f3fdef3a2ae3f643f3f3fe4a53f3f42
UTF-8 潁〓젙d凉붾졁筌잙젾潁〓젙d凉붾졁筌잙젾B 111001101011110110000001111000111000000010010011111011001010000010011001011001001110111110100101101110011110101110110110101111101110110010100001100000011110011110101101100011001110110010011110100110011110110010100000101111101110011010111101100000011110001110000000100100111110110010100000100110010110010011101111101001011011100111101011101101101011111011101100101000011000000111100111101011011000110011101100100111101001100111101100101000001011111001000010 e6bd81e38093eca09964efa5b9ebb6beeca181e7ad8cec9e99eca0bee6bd81e38093eca09964efa5b9ebb6beeca181e7ad8cec9e99eca0be42
UHC 潁〓젙d凉붾졁筌잙젾潁〓젙d凉붾졁筌잙젾B 111001111011100010100001111010111010000010010101011001001110010110111100100101001110101110100000101100101110111110100111100111111110101110100000101100001110011110111000101000011110101110100000100101010110010011100101101111001001010011101011101000001011001011101111101001111001111111101011101000001011000001000010 e7b8a1eba09564e5bc94eba0b2efa79feba0b0e7b8a1eba09564e5bc94eba0b2efa79feba0b042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)