To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 瘟??弱?オ鸚??B 1110000110001001001111110011111110001110111000110011111110000011010010011110101001011111001111110011111101000010 e1893f3f8ee33f8349ea5f3f3f42
EUC-JP 瘟??弱?オ鸚??B 1110000111101001001111110011111110111100111001010011111110100101101010101111001111000000001111110011111101000010 e1e93f3fbce53fa5aaf3c03f3f42
UTF-8 瘟룝많弱딁オ鸚싮콌B 11100111100110001001111111101011101000111001110111101011101001111000111011100101101111001011000111101011100101001000000111100011100000101010101011101001101110001001101011101100100010111010111011101100101111011000110001000010 e7989feba39deba78ee5bcb1eb9481e382aae9b89aec8baeecbd8c42
UHC 瘟룝많弱딁オ鸚싮콌B 11101000101100001011011111100100101110001011100111100101101100001000101011100111101010111010101011100101101001001001101011101001101100011000100001000010 e8b0b7e4b8b9e5b08ae7abaae5a49ae9b18842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)