To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????宥??亦??踰?┤幽??鈺?? 001111110011111100111111001111110011111100111111100101110100011100111111001111111001011010010010001111110011111111100110111110100011111110000100101001111001011101001000001111110011111111111011110001000011111100111111 3f3f3f3f3f3f97473f3f96923f3fe6fa3f84a797483f3ffbc43f3f
EUC-JP ??????宥??亦??踰?┤幽??鈺?? 00111111001111110011111100111111001111110011111111001101101010000011111100111111110010111111001000111111001111111110110011111100001111111010100010101001110011011010100100111111001111111000111111100011110101010011111100111111 3f3f3f3f3f3fcda83f3fcbf23f3fecfc3fa8a9cda93f3f8fe3d53f3f
UTF-8 閱묐챶杻⒳틠宥몄벑亦밸갭踰곤┤幽덀뀏鈺곌퓖 111010011001011010110001111010111010110010010000111011001011000110110110111011111010011110001000111000101001001010110011111011011000101110100000111001011010111010100101111010111010101010000100111010111011001010010001111001001011101010100110111010111011000010111000111010101011000010101101111010001011100010110000111010101011001110100100111000101001010010100100111001011011100110111101111010111000110110000000111010111000000010001111111010011000100010111010111010101011001110001100111011011001001110010110 e996b1ebac90ecb1b6efa788e292b3ed8ba0e5aea5ebaa84ebb291e4baa6ebb0b8eab0ade8b8b0eab3a4e294a4e5b9bdeb8d80eb808fe988baeab38ced9396
UHC 閱묐챶杻⒳틠宥몄벑亦밸갭踰곤┤幽덀뀏鈺곌퓖 111001101111001110010001111010111010101010000011111010101111010010101001111001001011101010001100111010101110100110111000111011001001001110110001111001101011001010111001111010111011000010111000111010111011001010110000111011111010011010101001111010101110101110001000111000111000010110001010111010001010110110110000111010101011111110000001 e6f391ebaa83eaf4a9e4ba8ceae9b8ec93b1e6b2b9ebb0b8ebb2b0efa6a9eaeb88e3858ae8adb0eabf81

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)