To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???爾g?飮??鶯 0011111100111111001111111000111010100010100000101000011100111111100111110101101000111111001111111110100111110010 3f3f3f8ea282873f9f5a3f3fe9f2
EUC-JP ???爾g?飮??鶯 0011111100111111001111111011110010100100101000111110011100111111110111011011101100111111001111111111001011110100 3f3f3fbca4a3e73fddbb3f3ff2f4
UTF-8 咽뉖뀘爾g뜏飮껋떼鶯 111011111010011010011110111010111000100110010110111010111000000010011000111001111000100010111110111011111011110110000111111010111001110010001111111010011010001110101110111010101011101110001011111010111001011010111100111010011011011010101111 efa69eeb8996eb8098e788beefbd87eb9c8fe9a3aeeabb8beb96bce9b6af
UHC 咽뉖뀘爾g뜏飮껋떼鶯 1110011011101100100001111110101110000101100100011110110010110011101000111110011110001101100100101110101111100110100000111110110010110110101111001110010110100011 e6ec87eb8591ecb3a3e78d92ebe683ecb6bce5a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)