To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??ビ?日???ビ?酉??ビ?日???ビ?酉B 00111111001111111000001101110010001111111001001111111010001111110011111100111111100000110111001000111111100100111101000100111111001111111000001101110010001111111001001111111010001111110011111100111111100000110111001000111111100100111101000101000010 3f3f83723f93fa3f3f3f83723f93d13f3f83723f93fa3f3f3f83723f93d142
EUC-JP ??ビ?日???ビ?酉??ビ?日???ビ?酉B 00111111001111111010010111010011001111111100011011111100001111110011111100111111101001011101001100111111110001101101001100111111001111111010010111010011001111111100011011111100001111110011111100111111101001011101001100111111110001101101001101000010 3f3fa5d33fc6fc3f3f3fa5d33fc6d33f3fa5d33fc6fc3f3f3fa5d33fc6d342
UTF-8 룶핊ビ룫日⒟룶핊ビ룫酉룶핊ビ룫日⒟룶핊ビ룫酉B 11101011101000111011011011101101100101011000101011100011100000111001001111101011101000111010101111100110100101111010010111100010100100101001111111101011101000111011011011101101100101011000101011100011100000111001001111101011101000111010101111101001100001011000100111101011101000111011011011101101100101011000101011100011100000111001001111101011101000111010101111100110100101111010010111100010100100101001111111101011101000111011011011101101100101011000101011100011100000111001001111101011101000111010101111101001100001011000100101000010 eba3b6ed958ae38393eba3abe697a5e2929feba3b6ed958ae38393eba3abe98589eba3b6ed958ae38393eba3abe697a5e2929feba3b6ed958ae38393eba3abe9858942
UHC 룶핊ビ룫日⒟룶핊ビ룫酉룶핊ビ룫日⒟룶핊ビ룫酉B 100011111010101111000000100011111010101111010011100011111010001011101100111011011010100111010000100011111010101111000000100011111010101111010011100011111010001011101011101101111000111110101011110000001000111110101011110100111000111110100010111011001110110110101001110100001000111110101011110000001000111110101011110100111000111110100010111010111011011101000010 8fabc08fabd38fa2eceda9d08fabc08fabd38fa2ebb78fabc08fabd38fa2eceda9d08fabc08fabd38fa2ebb742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)