To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 俉??節??燿??徇?俉??節??燿??徇?B 11111010011000010011111100111111100100001101111100111111001111111110000010100000001111110011111110011100011011010011111111111010011000010011111100111111100100001101111100111111001111111110000010100000001111110011111110011100011011010011111101000010 fa613f3f90df3f3fe0a03f3f9c6d3ffa613f3f90df3f3fe0a03f3f9c6d3f42
EUC-JP 俉??節??燿??徇?俉??節??燿??徇?B 100011111011000110111011001111110011111111000000111000010011111100111111111000001010001000111111001111111101011111001110001111111000111110110001101110110011111100111111110000001110000100111111001111111110000010100010001111110011111111010111110011100011111101000010 8fb1bb3f3fc0e13f3fe0a23f3fd7ce3f8fb1bb3f3fc0e13f3fe0a23f3fd7ce3f42
UTF-8 俉녘뙣節쏙슴燿녕쥈徇똺俉녘뙣節쏙슴燿녕쥈徇똺B 11100100101111111000100111101011100001011001100011101011100110011010001111100111101011111000000011101100100011111001100111101100100010101011010011100111100001111011111111101011100001011001010111101100101001011000100011100101101111101000011111101011100110001011101011100100101111111000100111101011100001011001100011101011100110011010001111100111101011111000000011101100100011111001100111101100100010101011010011100111100001111011111111101011100001011001010111101100101001011000100011100101101111101000011111101011100110001011101001000010 e4bf89eb8598eb99a3e7af80ec8f99ec8ab4e787bfeb8595eca588e5be87eb98bae4bf89eb8598eb99a3e7af80ec8f99ec8ab4e787bfeb8595eca588e5be87eb98ba42
UHC 俉녘뙣節쏙슴燿녕쥈徇똺俉녘뙣節쏙슴燿녕쥈徇똺B 111001111110101110110011111010001000110010101000111011111011110110111101111011111011110110111111111010001111110010110011111001111010001010000001111000101101111110001100011110101110011111101011101100111110100010001100101010001110111110111101101111011110111110111101101111111110100011111100101100111110011110100010100000011110001011011111100011000111101001000010 e7ebb3e88ca8efbdbdefbdbfe8fcb3e7a281e2df8c7ae7ebb3e88ca8efbdbdefbdbfe8fcb3e7a281e2df8c7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)