To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 節??辱??嗚??節ユ?節??辱?????埇 100100001101111100111111001111111001000001001010001111110011111110011010011010100011111100111111100100001101111110000011100001100011111110010000110111110011111100111111100100000100101000111111001111110011111100111111001111111111101010011010 90df3f3f904a3f3f9a6a3f3f90df83863f90df3f3f904a3f3f3f3f3ffa9a
EUC-JP 節??辱??嗚??節ユ?節??辱?????埇 11000000111000010011111100111111101111111010101100111111001111111101001111001011001111110011111111000000111000011010010111100110001111111100000011100001001111110011111110111111101010110011111100111111001111110011111100111111100011111011011111100111 c0e13f3fbfab3f3fd3cb3f3fc0e1a5e63fc0e13f3fbfab3f3f3f3f3f8fb7e7
UTF-8 節억쉭辱녻뮅嗚뷂슭節ユ콖節억쉭辱녻뮅若잍꼇埇 111001111010111110000000111011001001011010110101111011001000100110101101111010001011111010110001111010111000010110111011111010111010111010000101111001011001011110011010111010111011011110000010111011001000101010101101111001111010111110000000111000111000001110100110111011001011110110010110111001111010111110000000111011001001011010110101111011001000100110101101111010001011111010110001111010111000010110111011111010111010111010000101111011111010010110110100111011001001111010001101111010101011110010000111111001011001111110000111 e7af80ec96b5ec89ade8beb1eb85bbebae85e5979aebb782ec8aade7af80e383a6ecbd96e7af80ec96b5ec89ade8beb1eb85bbebae85efa5b4ec9e8deabc87e59f87
UHC 節억쉭辱녻뮅嗚뷂슭節ユ콖節억쉭辱녻뮅若잍꼇埇 1110111110111101101111101110111110111101101011011110100110110100100001101110100010010010100101001110011111110000100101001110111110111101101111101110111110111101101010111110011010110001100100001110111110111101101111101110111110111101101011011110100110110100100001101110100010010010100101001110010110101110100111111110011010110010101110111110100110111001 efbdbeefbdade9b486e89294e7f094efbdbeefbdabe6b190efbdbeefbdade9b486e89294e5ae9fe6b2bbe9b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)