To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 臍??貊曼?基??臍???????傲? 11100100011000000011111100111111111001101011101110011001110101100011111110001010111011100011111100111111111001000110000000111111001111110011111100111111001111110011111100111111100110001111110000111111 e4603f3fe6bb99d63f8aee3f3fe4603f3f3f3f3f3f3f98fc3f
EUC-JP 臍??貊曼嫄基??臍????璲??傲? 1110011111000001001111110011111111101100101111011101001011011000100011111011101010100001101101001111000000111111001111111110011111000001001111110011111100111111001111111000111111001100111001010011111100111111110100001111111000111111 e7c13f3fecbdd2d88fbaa1b4f03f3fe7c13f3f3f3f8fcce53f3fd0fe3f
UTF-8 臍樓렰貊曼嫄基렰렔臍陋렩렭뤳璲咽겨傲츓 111010001000011110001101111011111010010110001100111010111010000010110000111010001011001010001010111001101001101110111100111001011010101110000100111001011001111110111010111010111010000010110000111010111010000010010100111010001000011110001101111011111010010110010001111010111010000010101001111010111010000010101101111010111010010010110011111001111001001010110010111011111010011010011110111010101011001010101000111001011000001010110010111011001011100010010011 e8878defa58ceba0b0e8b28ae69bbce5ab84e59fbaeba0b0eba094e8878defa591eba0a9eba0adeba4b3e792b2efa69eeab2a8e582b2ecb893
UHC 臍樓렰貊曼嫄基렰렔臍陋렩렭뤳璲咽겨傲츓 1111000010110000110100101110011010001110101111011101100011100111110110001011101011101010101100011101000011110001100011101011110110001110101010011111000010110000110100101110101110001110101101111000111010111010100011111110000111100010101100001110011011101100101100001101110011100111111011001010111010001110 f0b0d2e68ebdd8e7d8baeab1d0f18ebd8ea9f0b0d2eb8eb78eba8fe1e2b0e6ecb0dce7ecae8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)