To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???甸ョお???? 00111111001111110011111110011001101100101000001110000111100000101010100000111111001111110011111100111111 3f3f3f99b2838782a83f3f3f3f
EUC-JP ???甸ョお???? 00111111001111110011111111010010101101001010010111100111101001001010101000111111001111110011111100111111 3f3f3fd2b4a5e7a4aa3f3f3f3f
UTF-8 黎곹쓷甸ョお溜쒐츕溜 111011111010011010001001111010101011001110111001111011001001001110110111111001111001010010111000111000111000001110100111111000111000000110001010111011111010011110001011111011001001001010010000111011001011100010010101111011111010011110001011 efa689eab3b9ec93b7e794b8e383a7e3818aefa78bec9290ecb895efa78b
UHC 黎곹쓷甸ョお溜쒐츕溜 1110011010110001100000011110110110011101100101001110111110100100101010111110011110101010101010101110101011111110100111001110011110101110100011111110101011111110 e6b181ed9d94efa4abe7aaaaeafe9ce7ae8feafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)