To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??淫??域??悠??惟???耳 111010100100000000111111001111111110001111100101001111110011111110001000111110100011111100111111100010001110011000111111001111111001011101001001001111110011111110001000110100100011111100111111001111111000111010101000 ea403f3fe3e53f3f88fa3f3f88e63f3f97493f3f88d23f3f3f8ea8
EUC-JP 鵝??肄??淫??域??悠??惟???耳 111100111010000100111111001111111110011011100111001111110011111110110000111111000011111100111111101100001110100000111111001111111100110110101010001111110011111110110000110101000011111100111111001111111011110010101010 f3a13f3fe6e73f3fb0fc3f3fb0e83f3fcdaa3f3fb0d43f3f3fbcaa
UTF-8 鵝숈뮆肄덃끽淫딇닞域밟뫁悠⑼쬇惟걔뷴첎耳 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001101011011110101011111010111001010010000111111010111000101110011110111001011001111110011111111010111011000010011111111010111010101110000001111001101000001010100000111000101001000110111100111011001010110010000111111001101000001110011111111010101011000110010100111010111011011110110100111011001011001010001110111010001000000010110011 e9b59dec8888ebae86e88284eb8d83eb81bde6b7abeb9487eb8b9ee59f9febb09febab81e682a0e291bcecac87e6839feab194ebb7b4ecb28ee880b3
UHC 鵝숈뮆肄덃끽淫딇닞域밟뫁悠⑼쬇惟걔뷴첎耳 11100100101111011001100111101100100100101001010111101100101111011000100011100110101100111010001111101011111000101000101011101101100010001001111011100110101101001011100111100010100100011010010111101010111011011010100111101111101001101001111011101010111011101011000011000010101110101110010110101010100110111110110010111100 e4bd99ec9295ecbd88e6b3a3ebe28aed889ee6b4b9e291a5eaeda9efa69eeaeeb0c2bae5aa9becbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)