To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俉??汚??梧??節ワ????節??鈺?< 1111101001100001001111110011111110001001100110000011111100111111100011001110011000111111001111111001000011011111100000111000111100111111001111110011111100111111100100001101111100111111001111111111101111000100001111111000000110000011 fa613f3f89983f3f8ce63f3f90df838f3f3f3f3f90df3f3ffbc43f8183
EUC-JP 俉??汚??梧??節ワ????節??鈺?< 10001111101100011011101100111111001111111011000111111000001111110011111110111000111010000011111100111111110000001110000110100101111011110011111100111111001111110011111111000000111000010011111100111111100011111110001111010101001111111010000111100011 8fb1bb3f3fb1f83f3fb8e83f3fc0e1a5ef3f3f3f3fc0e13f3f8fe3d53fa1e3
UTF-8 俉놂슝汚억슬梧삥쮵節ワ쉠樂띺긽節억슴鈺썲< 111001001011111110001001111010111000011010000010111011001000101010011101111001101011000110011010111011001001011010110101111011001000101010101100111001101010001010100111111011001000001010100101111011001010111010110101111001111010111110000000111000111000001110101111111011001000100110100000111011111010011010111111111010111001110110111010111010101011100010111101111001111010111110000000111011001001011010110101111011001000101010110100111010011000100010111010111011001000110110110010111011111011110010011100 e4bf89eb8682ec8a9de6b19aec96b5ec8aace6a2a7ec82a5ecaeb5e7af80e383afec89a0efa6bfeb9dbaeab8bde7af80ec96b5ec8ab4e988baec8db2efbc9c
UHC 俉놂슝汚억슬梧삥쮵節ワ쉠樂띺긽節억슴鈺썲< 111001111110101110110011111011111011110110111001111001111111110110111110111011111011110110111101111001111111110010111011111001101010100010010010111011111011110110101011111011111011110110101010111010001111100110001101111010011000001110000001111011111011110110111110111011111011110110111111111010001010110110111101111001011010001110111100 e7ebb3efbdb9e7fdbeefbdbde7fcbbe6a892efbdabefbdaae8f98de98381efbdbeefbdbfe8adbde5a3bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)