To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????[BF 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010110110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5b4246
SJIS-WIN 鳶??墺??央??鳶??墺??央??[BF 100100111100111000111111001111111001101011010010001111110011111110001001100110110011111100111111100100111100111000111111001111111001101011010010001111110011111110001001100110110011111100111111010110110100001001000110 93ce3f3f9ad23f3f899b3f3f93ce3f3f9ad23f3f899b3f3f5b4246
EUC-JP 鳶??墺??央??鳶??墺??央??[BF 110001101101000000111111001111111101010011010100001111110011111110110001111110110011111100111111110001101101000000111111001111111101010011010100001111110011111110110001111110110011111100111111010110110100001001000110 c6d03f3fd4d43f3fb1fb3f3fc6d03f3fd4d43f3fb1fb3f3f5b4246
UTF-8 鳶멨뎴墺든떥央뉏퓱鳶멨뎴墺든떥央뉓떨[BF 111010011011001110110110111010111010100110101000111010111000111010110100111001011010001010111010111010111001001110100000111010111001011010100101111001011010010010101110111010111000100110001111111011011001001110110001111010011011001110110110111010111010100110101000111010111000111010110100111001011010001010111010111010111001001110100000111010111001011010100101111001011010010010101110111010111000100110010011111010111001011010101000010110110100001001000110 e9b3b6eba9a8eb8eb4e5a2baeb93a0eb96a5e5a4aeeb898fed93b1e9b3b6eba9a8eb8eb4e5a2baeb93a0eb96a5e5a4aeeb8993eb96a85b4246
UHC 鳶멨뎴墺든떥央뉏퓱鳶멨뎴墺든떥央뉓떨[BF 111001101110100110111000111001011000100110000111111001111111001010110101111001111000101110111000111001001110011110000111111001001011111110010111111001101110100110111000111001011000100110000111111001111111001010110101111001111000101110111000111001001110011110000111111010001011011010110011010110110100001001000110 e6e9b8e58987e7f2b5e78bb8e4e787e4bf97e6e9b8e58987e7f2b5e78bb8e4e787e8b6b35b4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)