To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?怨封?耿??垣? 100100101010001000111111100010011000010110010101100101010011111111100011110101000011111100111111100010100101111100111111 92a23f898595953fe3d43f3f8a5f3f
EUC-JP 弔?怨封?耿??垣? 110001001010010000111111101100011110010111001001111101010011111111100110110101100011111100111111101100111100000000111111 c4a43fb1e5c9f53fe6d63f3fb3c03f
UTF-8 弔렲怨封렮耿렱렲垣렖 111001011011110010010100111010111010000010110010111001101000000010101000111001011011000010000001111010111010000010101110111010001000000010111111111010111010000010110001111010111010000010110010111001011001111010100011111010111010000010010110 e5bc94eba0b2e680a8e5b081eba0aee880bfeba0b1eba0b2e59ea3eba096
UHC 弔렲怨封렮耿렱렲垣렖 1111000011000000100011101011111111101010101100111101110011100110100011101011101111001100111010101000111010111110100011101011111111101010101011111000111010101011 f0c08ebfeab3dce68ebbccea8ebe8ebfeaaf8eab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)