To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 丹遜他竪促奪奪叩 10010010010011111001000110111011100100011011110010010010010001111001000110100011100100100100010010010010010001001001001001000000 924f91bb91bc924791a3924492449240
EUC-JP 丹遜他竪促奪奪叩 11000011101100001100001010111101110000101011111011000011101010001100001010100101110000111010010111000011101001011100001110100001 c3b0c2bdc2bec3a8c2a5c3a5c3a5c3a1
UTF-8 丹遜他竪促奪奪叩 111001001011100010111001111010011000000110011100111001001011101110010110111001111010101110101010111001001011111110000011111001011010010110101010111001011010010110101010111001011000111110101001 e4b8b9e9819ce4bb96e7abaae4bf83e5a5aae5a5aae58fa9
UHC 丹遜他竪促奪奪叩 11010011101000011110000111100001111101101110001011100010101101011111010110110101111101111010110011110111101011001100110110110000 d3a1e1e1f6e2e2b5f5b5f7acf7accdb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)