To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 堊??悟??岳??[堊??悟??岳??[^ 100110101011111100111111001111111000110011100101001111110011111110001010011110000011111100111111010110111001101010111111001111110011111110001100111001010011111100111111100010100111100000111111001111110101101101011110 9abf3f3f8ce53f3f8a783f3f5b9abf3f3f8ce53f3f8a783f3f5b5e
EUC-JP 堊??悟??岳??[堊??悟??岳??[^ 110101001100000100111111001111111011100011100111001111110011111110110011110110010011111100111111010110111101010011000001001111110011111110111000111001110011111100111111101100111101100100111111001111110101101101011110 d4c13f3fb8e73f3fb3d93f3f5bd4c13f3fb8e73f3fb3d93f3f5b5e
UTF-8 堊뜸퓙悟녵쓺岳꾥쁿[堊뜸퓙悟녵쓺岳꾥쁿[^ 111001011010000010001010111010111001110010111000111011011001001110011001111001101000001010011111111010111000010110110101111011001001001110111010111001011011001010110011111010101011111010100101111011001000000110111111010110111110010110100000100010101110101110011100101110001110110110010011100110011110011010000010100111111110101110000101101101011110110010010011101110101110010110110010101100111110101010111110101001011110110010000001101111110101101101011110 e5a08aeb9cb8ed9399e6829feb85b5ec93bae5b2b3eabea5ec81bf5be5a08aeb9cb8ed9399e6829feb85b5ec93bae5b2b3eabea5ec81bf5b5e
UHC 堊뜸퓙悟녵쓺岳꾥쁿[堊뜸퓙悟녵쓺岳꾥쁿[^ 111001001011111010110110111001001011111110000100111001111111011010000110111001001011111010110110111001001011111110000100111010001001100010000110010110111110010010111110101101101110010010111111100001001110011111110110100001101110010010111110101101101110010010111111100001001110100010011000100001100101101101011110 e4beb6e4bf84e7f686e4beb6e4bf84e898865be4beb6e4bf84e7f686e4beb6e4bf84e898865b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)