To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???軟??譽??B 001111110011111100111111100100111110111000111111001111111110011010100011001111110011111101000010 3f3f3f93ee3f3fe6a33f3f42
EUC-JP 獒??軟??譽??B 1000111111001011101110110011111100111111110001101111000000111111001111111110110010100101001111110011111101000010 8fcbbb3f3fc6f03f3feca53f3f42
UTF-8 獒앲꽋軟⑵짔譽쏁뀳B 11100111100011011001001011101100100101011011001011101010101111011000101111101000101110111001111111100010100100011011010111101100101001111001010011101000101011011011110111101100100011111000000111101011100000001011001101000010 e78d92ec95b2eabd8be8bb9fe291b5eca794e8adbdec8f81eb80b342
UHC 獒앲꽋軟⑵짔譽쏁뀳B 11101000101000111001110111101000100001001001101111100110111000111010100111101000101000111001110111100111111000101001101111100111100001011010100101000010 e8a39de8849be6e3a9e8a39de7e29be785a942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)