To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???猶??揄??敖 00111111001111110011111110010111010100000011111100111111100111011000100100111111001111111001110111000010 3f3f3f97503f3f9d893f3f9dc2
EUC-JP ???猶??揄??敖 00111111001111110011111111001101101100010011111100111111110110011110100100111111001111111101101011000100 3f3f3fcdb13f3fd9e93f3fdac4
UTF-8 緣쏅쪈猶뉓염揄얜뤆敖 111001111011011110100011111011001000111110000101111011001010101010001000111001111000110010110110111010111000100110010011111011001001011110111100111001101000111110000100111011001001011010011100111010111010010010000110111001101001010110010110 e7b7a3ec8f85ecaa88e78cb6eb8993ec97bce68f84ec969ceba486e69596
UHC 緣쏅쪈猶뉓염揄얜뤆敖 1110011011011110100110111110101110100101100000101110101110100010100001111110100010111111101100001110101011110001101111101110101110001111101101101110011111111001 e6de9beba582eba287e8bfb0eaf1beeb8fb6e7f9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)