To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??濡??語η?筍レ?筌??裕 11100001100111110011111100111111100010110101100000111111001111111001010001000111001111110011111110001100111010101000001111000101001111111110001010100001100000111000110000111111111000101010001100111111001111111001011101010100 e19f3f3f8b583f3f94473f3f8cea83c53fe2a1838c3fe2a33f3f9754
EUC-JP 癲??宜??濡??語η?筍レ?筌??裕 11100010101000010011111100111111101101011011100100111111001111111100011110101000001111110011111110111000111011001010011011000111001111111110010010100011101001011110110000111111111001001010010100111111001111111100110110110101 e2a13f3fb5b93f3fc7a83f3fb8eca6c73fe4a3a5ec3fe4a53f3fcdb5
UTF-8 癲덈챶宜룝슭濡⑸눀語η몭筍レ쑂筌뤾쑬裕 1110011110011001101100101110101110001101100010001110110010110001101101101110010110101110100111001110101110100011100111011110110010001010101011011110011010111111101000011110001010010001101110001110101110001000100000001110100010101010100111101100111010110111111010111010101010101101111001111010110110001101111000111000001110101100111011001001000110000010111001111010110110001100111010111010010010111110111011001001000110101100111010001010001110010101 e799b2eb8d88ecb1b6e5ae9ceba39dec8aade6bfa1e291b8eb8880e8aa9eceb7ebaaade7ad8de383acec9182e7ad8ceba4beec91ace8a395
UHC 癲덈챶宜룝슭濡⑸눀語η몭筍レ쑂筌뤾쑬裕 1110111110100110100010001110101110101010100000111110101111110001101101111110010010111101101111101110101110100001101010011110101110000111101000011110010111011110101001011110011110010001100101111110001011101100101010111110110010011100101000101110111110100111100011111110101010111110101010001110101110101110 efa688ebaa83ebf1b7e4bdbeeba1a9eb87a1e5dea5e79197e2ecabec9ca2efa78feabea8ebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)