To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????G 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f47
SJIS-WIN 霎ソ蛛エ雜ウ譽夂カ壽純隹キ蟄ォ蜊ウG 11101000101111101011111111100101100000011011010011101000101101101011001111100110101000111001101011100111101101101001101011100110100011111000001111101000101100001011011111100101101011011010101111100101100011011011001101000111 e8bebfe581b4e8b6b3e6a39ae7b69ae68f83e8b0b7e5adabe58db347
EUC-JP 霎ソ蛛エ雜ウ譽夂カ壽純隹キ蟄ォ蜊ウG 1111000011000000100011101011111111101001111000011000111010110100111100001011100010001110101100111110110010100101110101001110100110001110101101101101010011101000101111011110001111110000101100101000111010110111111010101010111110001110101010111110100111101101100011101011001101000111 f0c08ebfe9e18eb4f0b88eb3eca5d4e98eb6d4e8bde3f0b28eb7eaaf8eabe9ed8eb347
UTF-8 霎ソ蛛エ雜ウ譽夂カ壽純隹キ蟄ォ蜊ウG 11101001100111001000111011101111101111011011111111101000100110111001101111101111101111011011010011101001100110111001110011101111101111011011001111101000101011011011110111100101101001001000001011101111101111011011011011100101101000111011110111100111101101001001010011101001100110101011100111101111101111011011011111101000100111111000010011101111101111011010101111101000100111001000101011101111101111011011001101000111 e99c8eefbdbfe89b9befbdb4e99b9cefbdb3e8adbde5a482efbdb6e5a3bde7b494e99ab9efbdb7e89f84efbdabe89c8aefbdb347
UHC ??蛛?雜?譽??壽純??蟄???G 001111110011111111110001110010000011111111101101110110100011111111100111111000100011111100111111111000011111100011100010111011010011111100111111111101101101111000111111001111110011111101000111 3f3ff1c83fedda3fe7e23f3fe1f8e2ed3f3ff6de3f3f3f47

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)