To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鳶??擁??央??鳶??墺?????鳶??耶 1001001111001110001111110011111110010111011010010011111100111111100010011001101100111111001111111001001111001110001111110011111110011010110100100011111100111111001111110011111100111111100100111100111000111111001111111001011011101011 93ce3f3f97693f3f899b3f3f93ce3f3f9ad23f3f3f3f3f93ce3f3f96eb
EUC-JP 鳶??擁??央??鳶??墺?????鳶??耶 1100011011010000001111110011111111001101110010100011111100111111101100011111101100111111001111111100011011010000001111110011111111010100110101000011111100111111001111110011111100111111110001101101000000111111001111111100110011101101 c6d03f3fcdca3f3fb1fb3f3fc6d03f3fd4d43f3f3f3f3fc6d03f3fcced
UTF-8 鳶멩뿈擁녕떥央뉓떨鳶멨뎴墺든떥嶪뤺떨鳶멨뎴耶 111010011011001110110110111010111010100110101001111010111011111110001000111001101001001110000001111010111000010110010101111010111001011010100101111001011010010010101110111010111000100110010011111010111001011010101000111010011011001110110110111010111010100110101000111010111000111010110100111001011010001010111010111010111001001110100000111010111001011010100101111001011011011010101010111010111010010010111010111010111001011010101000111010011011001110110110111010111010100110101000111010111000111010110100111010001000000010110110 e9b3b6eba9a9ebbf88e69381eb8595eb96a5e5a4aeeb8993eb96a8e9b3b6eba9a8eb8eb4e5a2baeb93a0eb96a5e5b6aaeba4baeb96a8e9b3b6eba9a8eb8eb4e880b6
UHC 鳶멩뿈擁녕떥央뉓떨鳶멨뎴墺든떥嶪뤺떨鳶멨뎴耶 1110011011101001101110001110011010010111100011111110100010110110101100111110011110001011101110001110010011100111100001111110100010110110101100111110011011101001101110001110010110001001100001111110011111110010101101011110011110001011101110001110010111110101100011111110100010110110101100111110011011101001101110001110010110001001100001111110010110101101 e6e9b8e6978fe8b6b3e78bb8e4e787e8b6b3e6e9b8e58987e7f2b5e78bb8e5f58fe8b6b3e6e9b8e58987e5ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)