To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???伊??淫?????已???μ?癲??葵B 0011111100111111001111111000100011001001001111110011111110001000111110100011111100111111001111110011111100111111100110111101111100111111001111110011111110000011110010100011111111100001100111110011111100111111100010001010100001000010 3f3f3f88c93f3f88fa3f3f3f3f3f9bdf3f3f3f83ca3fe19f3f3f88a842
EUC-JP ???伊??淫?????已??洹μ?癲??葵B 00111111001111110011111110110000110010110011111100111111101100001111110000111111001111110011111100111111001111111101011011100001001111110011111110001111110001111011101010100110110011000011111111100010101000010011111100111111101100001010101001000010 3f3f3fb0cb3f3fb0fc3f3f3f3f3fd6e13f3f8fc7baa6cc3fe2a13f3fb0aa42
UTF-8 捻뀀맩伊됭돞淫딅꽧凉쏆슱已깁넫洹μ맯癲섍퉮葵B 111011111010011010100100111010111000000010000000111010111010011110101001111001001011110010001010111010111001000010101101111010111000111110011110111001101011011110101011111010111001010010000101111010101011110110100111111011111010010110111001111011001000111110000110111011001000101010110001111001011011011110110010111010101011100110000001111010111000010010101011111001101011010010111001110011101011110011101011101001111010111111100111100110011011001011101100100001001000110111101101100010011010111011101000100100011011010101000010 efa6a4eb8080eba7a9e4bc8aeb90adeb8f9ee6b7abeb9485eabda7efa5b9ec8f86ec8ab1e5b7b2eab981eb84abe6b4b9cebceba7afe799b2ec848ded89aee891b542
UHC 捻뀀맩伊됭돞淫딅꽧凉쏆슱已깁넫洹μ맯癲섍퉮葵B 111001101111011110110010111010111001000010110001111011001010010110001001111010001000100110100100111010111110001010001010111010111000010010110010111001011011110010011011111011001001101010111000111011001010101110110001111010011000011010101011111010101011011110100101111011001001000010110110111011111010011010011000111010101011100110000110110100001010110101000010 e6f7b2eb90b1eca589e889a4ebe28aeb84b2e5bc9bec9ab8ecabb1e986abeab7a5ec90b6efa698eab986d0ad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)