To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?手邯??モ?日も? 001111111000111011101000111001111011011000111111001111111000001110000010001111111001001111111010100000101110000000111111 3f8ee8e7b63f3f83823f93fa82e03f
EUC-JP ?手邯??モ?日も? 001111111011110011101010111011101011100000111111001111111010010111100010001111111100011011111100101001001110001000111111 3fbceaeeb83f3fa5e23fc6fca4e23f
UTF-8 룶手邯룫콓モ룫日も룶 111010111010001110110110111001101000100110001011111010011000001010101111111010111010001110101011111011001011110110010011111000111000001110100010111010111010001110101011111001101001011110100101111000111000001010000010111010111010001110110110 eba3b6e6898be982afeba3abecbd93e383a2eba3abe697a5e38282eba3b6
UHC 룶手邯룫콓モ룫日も룶 1000111110101011111000101010001011001010111110111000111110100010101100011000111110101011111000101000111110100010111011001110110110101010111000101000111110101011 8fabe2a2cafb8fa2b18fabe28fa2ecedaae28fab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)