To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鉅ャ遒≫燬蠑迎鉅ャ遒≫燬蠑鶏^ 111001111110100010101100111001111010001010000001111000101110000010011011111001011011110010001100011111011110011111101000101011001110011110100010100000011110001011100000100110111110010110111100100011000111101101011110 e7e8ace7a281e2e09be5bc8c7de7e8ace7a281e2e09be5bc8c7b5e
EUC-JP 鉅ャ遒≫燬蠑迎鉅ャ遒≫燬蠑鶏^ 1110111011101010100011101010110011101110101001001010001011100100110111111111101111101010101111101011011111011110111011101110101010001110101011001110111010100100101000101110010011011111111110111110101010111110101101111101110001011110 eeea8eaceea4a2e4dffbeabeb7deeeea8eaceea4a2e4dffbeabeb7dc5e
UTF-8 鉅ャ遒≫燬蠑迎鉅ャ遒≫燬蠑鶏^ 11101001100010011000010111101111101111011010110011101001100000011001001011100010100010011010101111100111100001111010110011101000101000001001000111101000101111111000111011101001100010011000010111101111101111011010110011101001100000011001001011100010100010011010101111100111100001111010110011101000101000001001000111101001101101101000111101011110 e98985efbdace98192e289abe787ace8a091e8bf8ee98985efbdace98192e289abe787ace8a091e9b68f5e
UHC 鉅??≫??迎鉅??≫???^ 1100101111101001001111110011111110100001111011010011111100111111111001111100101011001011111010010011111100111111101000011110110100111111001111110011111101011110 cbe93f3fa1ed3f3fe7cacbe93f3fa1ed3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)