To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???違??臾??癒? 0011111100111111001111111000100011100001001111110011111111100100011010110011111100111111100101101111110000111111 3f3f3f88e13f3fe46b3f3f96fc3f
EUC-JP ???違??臾??癒? 0011111100111111001111111011000011100011001111110011111111100111110011000011111100111111110011001111111000111111 3f3f3fb0e33f3fe7cc3f3fccfe3f
UTF-8 黎싳뼃違띷퇍臾루독癒뀁 111011111010011010001001111011001000101110110011111010111011110010000011111010011000000110010101111010111001110110110111111011011000011110001101111010001000011110111110111010111010001110101000111010111000111110000101111001111001100110010010111010111000000010000001 efa689ec8bb3ebbc83e98195eb9db7ed878de887beeba3a8eb8f85e79992eb8081
UHC 黎싳뼃違띷퇍臾루독癒뀁 11100110101100011001101011101100100101101000110111101010110111101000110111100110101101111001111011101011101011001011011111100111101101011011011011101011101010001011001011101100 e6b19aec968deade8de6b79eebacb7e7b5b6eba8b2ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)