To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???柔??揄???▲????恂レ??κ? 001111110011111100111111100011110101111100111111001111111001110110001001001111110011111100111111100000011010001100111111001111110011111100111111100111001001011010000011100011000011111100111111100000111100100000111111 3f3f3f8f5f3f3f9d893f3f3f81a33f3f3f3f9c96838c3f3f83c83f
EUC-JP ???柔??揄??璵▲????恂レ??κ? 0011111100111111001111111011110111000000001111110011111111011001111010010011111100111111100011111100110011100110101000101010010100111111001111110011111100111111110101111111011010100101111011000011111100111111101001101100101000111111 3f3f3fbdc03f3fd9e93f3f8fcce6a2a53f3f3f3fd7f6a5ec3f3fa6ca3f
UTF-8 輦깅벉柔꾤몭揄멤뵱璵▲룂泥롳쫳恂レ뵦若κ렌 1110111110100110100110001110101010111001100001011110101110110010100010011110011010011111100101001110101010111110101001001110101110101010101011011110011010001111100001001110101110101001101001001110101110110101101100011110011110010010101101011110001010010110101100101110101110100011100000101110111110100111101000111110101110100001101100111110110010101011101100111110011010000001100000101110001110000011101011001110101110110101101001101110111110100101101101001100111010111010111010111010000010001100 efa698eab985ebb289e69f94eabea4ebaaade68f84eba9a4ebb5b1e792b5e296b2eba382efa7a3eba1b3ecabb3e68182e383acebb5a6efa5b4cebaeba08c
UHC 輦깅벉柔꾤몭揄멤뵱璵▲룂泥롳쫳恂レ뵦若κ렌 111001101110010010110001111010111001001110101100111010101111010110000100111001111001000110010111111010101111000110111000111000101001010010101111111001101010010110100001111000111000111110000011111011001011001010001110111011111010011010001011111000101110000110101011111011001001010010100101111001011010111010100101111010101011011110111011 e6e4b1eb93aceaf584e79197eaf1b8e294afe6a5a1e38f83ecb28eefa68be2e1abec94a5e5aea5eab7bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)