To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霎ソ蛛エ雜ウ譽夂カ壼ュ倡矯蟄ォ邯壼キス雜ウ 111010001011111010111111111001011000000110110100111010001011011010110011111001101010001110011010111001111011011010011010111001011010110110011000111001111000101110111000111001011010110110101011111001111011011010011010111001011011011110111101111010001011011010110011 e8bebfe581b4e8b6b3e6a39ae7b69ae5ad98e78bb8e5adabe7b69ae5b7bde8b6b3
EUC-JP 霎ソ蛛エ雜ウ譽夂カ壼ュ倡矯蟄ォ邯壼キス雜ウ 111100001100000010001110101111111110100111100001100011101011010011110000101110001000111010110011111011001010010111010100111010011000111010110110110101001110011110001110101011011101000011101001101101101011101011101010101011111000111010101011111011101011100011010100111001111000111010110111100011101011110111110000101110001000111010110011 f0c08ebfe9e18eb4f0b88eb3eca5d4e98eb6d4e78eadd0e9b6baeaaf8eabeeb8d4e78eb78ebdf0b88eb3
UTF-8 霎ソ蛛エ雜ウ譽夂カ壼ュ倡矯蟄ォ邯壼キス雜ウ 111010011001110010001110111011111011110110111111111010001001101110011011111011111011110110110100111010011001101110011100111011111011110110110011111010001010110110111101111001011010010010000010111011111011110110110110111001011010001110111100111011111011110110101101111001011000000010100001111001111001111110101111111010001001111110000100111011111011110110101011111010011000001010101111111001011010001110111100111011111011110110110111111011111011110110111101111010011001101110011100111011111011110110110011 e99c8eefbdbfe89b9befbdb4e99b9cefbdb3e8adbde5a482efbdb6e5a3bcefbdade580a1e79fafe89f84efbdabe982afe5a3bcefbdb7efbdbde99b9cefbdb3
UHC ??蛛?雜?譽????倡矯蟄?邯???雜? 0011111100111111111100011100100000111111111011011101101000111111111001111110001000111111001111110011111100111111111100111101101111001110111011001111011011011110001111111100101011111011001111110011111100111111111011011101101000111111 3f3ff1c83fedda3fe7e23f3f3f3ff3dbceecf6de3fcafb3f3f3fedda3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)