To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?暖??睛???宏?暖??睛???槐^ 00111111100100100110011100111111001111111110000111001011001111110011111100111111100011010100011100111111100100100110011100111111001111111110000111001011001111110011111100111111100111101100010101011110 3f92673f3fe1cb3f3f3f8d473f92673f3fe1cb3f3f3f9ec55e
EUC-JP ?暖磻?睛?磻?宏?暖磻?睛?磻?槐^ 001111111100001111001000100011111101000010111010001111111110001011001101001111111000111111010000101110100011111110111001101010000011111111000011110010001000111111010000101110100011111111100010110011010011111110001111110100001011101000111111110111001100011101011110 3fc3c88fd0ba3fe2cd3f8fd0ba3fb9a83fc3c88fd0ba3fe2cd3f8fd0ba3fdcc75e
UTF-8 뤶暖磻녹睛핉磻노宏뤶暖磻녹睛핉磻노槐^ 11101011101001001011011011100110100110101001011011100111101000111011101111101011100001011011100111100111100111011001101111101101100101011000100111100111101000111011101111101011100001011011100011100101101011101000111111101011101001001011011011100110100110101001011011100111101000111011101111101011100001011011100111100111100111011001101111101101100101011000100111100111101000111011101111101011100001011011100011100110101001111001000001011110 eba4b6e69a96e7a3bbeb85b9e79d9bed9589e7a3bbeb85b8e5ae8feba4b6e69a96e7a3bbeb85b9e79d9bed9589e7a3bbeb85b8e6a7905e
UHC 뤶暖磻녹睛핉磻노宏뤶暖磻녹睛핉磻노槐^ 10001111111001001101000111101100110110101111001010110011111011001110111111101100110000001000111011011010111100101011001111101011110011101101101110001111111001001101000111101100110110101111001010110011111011001110111111101100110000001000111011011010111100101011001111101011110011101101100101011110 8fe4d1ecdaf2b3ecefecc08edaf2b3ebcedb8fe4d1ecdaf2b3ecefecc08edaf2b3ebced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)