To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嶸ヲ巐常丿巐卓アオ巐】嶸ヲ巐常丿巐卓アオ巐】B 111110101011010010100110111110101011011010001111111011011001100010100110111110101011011010010001111011001011000110110101111110101011011010000001011110101111101010110100101001101111101010110110100011111110110110011000101001101111101010110110100100011110110010110001101101011111101010110110100000010111101001000010 fab4a6fab68fed98a6fab691ecb1b5fab6817afab4a6fab68fed98a6fab691ecb1b5fab6817a42
EUC-JP 嶸ヲ巐常丿巐卓アオ巐】嶸ヲ巐常丿巐卓アオ巐】B 1000111110111011111101001000111010100110100011111011101111111001101111101110111111010000101010001000111110111011111110011100001011101110100011101011000110001110101101011000111110111011111110011010000111011011100011111011101111110100100011101010011010001111101110111111100110111110111011111101000010101000100011111011101111111001110000101110111010001110101100011000111010110101100011111011101111111001101000011101101101000010 8fbbf48ea68fbbf9beefd0a88fbbf9c2ee8eb18eb58fbbf9a1db8fbbf48ea68fbbf9beefd0a88fbbf9c2ee8eb18eb58fbbf9a1db42
UTF-8 嶸ヲ巐常丿巐卓アオ巐】嶸ヲ巐常丿巐卓アオ巐】B 11100101101101101011100011101111101111011010011011100101101101111001000011100101101110001011100011100100101110001011111111100101101101111001000011100101100011011001001111101111101111011011000111101111101111011011010111100101101101111001000011100011100000001001000111100101101101101011100011101111101111011010011011100101101101111001000011100101101110001011100011100100101110001011111111100101101101111001000011100101100011011001001111101111101111011011000111101111101111011011010111100101101101111001000011100011100000001001000101000010 e5b6b8efbda6e5b790e5b8b8e4b8bfe5b790e58d93efbdb1efbdb5e5b790e38091e5b6b8efbda6e5b790e5b8b8e4b8bfe5b790e58d93efbdb1efbdb5e5b790e3809142
UHC 嶸??常??卓???】嶸??常??卓???】B 11100111101011100011111100111111110111111100100000111111001111111111011011110001001111110011111100111111101000011011110111100111101011100011111100111111110111111100100000111111001111111111011011110001001111110011111100111111101000011011110101000010 e7ae3f3fdfc83f3ff6f13f3f3fa1bde7ae3f3fdfc83f3ff6f13f3f3fa1bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)