To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悅??弱??喩??? 11111010101111010011111100111111100011101110001100111111001111111001101001100111001111110011111100111111 fabd3f3f8ee33f3f9a673f3f3f
EUC-JP ???弱??喩??? 001111110011111100111111101111001110010100111111001111111101001111001000001111110011111100111111 3f3f3fbce53f3fd3c83f3f3f
UTF-8 悅쎈젷弱뉐굄喩드쉑藺 111001101000001010000101111011001000111010001000111011001010000010110111111001011011110010110001111010111000100110010000111010101011010110000100111001011001011010101001111010111001001110011100111011001000100110010001111011111010011110110000 e68285ec8e88eca0b7e5bcb1eb8990eab584e596a9eb939cec8991efa7b0
UHC 悅쎈젷弱뉐굄喩드쉑藺 1110011011101101101111011110101110100000101010111110010110110000100001111110010110110001101011111110101011100111101101011110010110111101101001111110110011100001 e6edbdeba0abe5b087e5b1afeae7b5e5bda7ece1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)