To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蜈??夜??鸚????????蜈??節?? 1110010110000101001111110011111110010110111010010011111100111111111010100101111100111111001111110011111100111111001111110011111100111111001111111110010110000101001111110011111110010000110111110011111100111111 e5853f3f96e93f3fea5f3f3f3f3f3f3f3f3fe5853f3f90df3f3f
EUC-JP 蜈??夜??鸚?????旿??蜈??節?? 11101001111001010011111100111111110011001110101100111111001111111111001111000000001111110011111100111111001111110011111110001111110000011111010000111111001111111110100111100101001111110011111111000000111000010011111100111111 e9e53f3fcceb3f3ff3c03f3f3f3f3f8fc1f43f3fe9e53f3fc0e13f3f
UTF-8 蜈졾ㄷ夜쇠쪧鸚㏆쉼樂곭쐴旿울쉘蜈졾ㄷ節경ː 1110100010011100100010001110110010100001101111101110001110000100101101111110010110100100100111001110110010000111101000001110110010101010101001111110100110111000100110101110001110001111100001101110110010001001101111001110111110100110101111111110101010110011101011011110110010010000101101001110011010010111101111111110110010011010101110001110110010001001100110001110100010011100100010001110110010100001101111101110001110000100101101111110011110101111100000001110101010110010101111011100101110010000 e89c88eca1bee384b7e5a49cec87a0ecaaa7e9b89ae38f86ec89bcefa6bfeab3adec90b4e697bfec9ab8ec8998e89c88eca1bee384b7e7af80eab2bdcb90
UHC 蜈졾ㄷ夜쇠쪧鸚㏆쉼樂곭쐴旿울쉘蜈졾ㄷ節경ː 111010001010010110100000111001011010010010100111111001011010100010111100111010001010010110100000111001011010010010100111111011111011110110110000111010001111100110000001111001111011111010100001111001111111101010111111111011111011110110101001111010001010010110100000111001011010010010100111111011111011110110110000111001101010001010110000 e8a5a0e5a4a7e5a8bce8a5a0e5a4a7efbdb0e8f981e7bea1e7fabfefbda9e8a5a0e5a4a7efbdb0e6a2b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)