To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鼇??瓦??由?┨??????畑?┰???^ 11101010100001110011111100111111100010101010001000111111001111111001011101010010001111111000010010110111001111110011111100111111001111110011111100111111100101001010100000111111100001001011101100111111001111110011111101011110 ea873f3f8aa23f3f97523f84b73f3f3f3f3f3f94a83f84bb3f3f3f5e
EUC-JP 鼇??瓦??由?┨濚?????畑?┰???^ 111100111110011100111111001111111011010010100100001111110011111111001101101100110011111110101000101110011000111111001001101000010011111100111111001111110011111100111111110010001010101000111111101010001011110100111111001111110011111101011110 f3e73f3fb4a43f3fcdb33fa8b98fc9a13f3f3f3f3fc8aa3fa8bd3f3f3f5e
UTF-8 鼇귣젛瓦귨쪖由썹┨濚껇쮥溜쀨퓗畑먮┰獵방짎^ 11101001101111001000011111101010101101111010001111101100101000001001101111100111100100111010011011101010101101111010100011101100101010101001011011100111100101001011000111101100100011011011100111100010100101001010100011100110101111111001101011101010101110111000011111101100101011101010010111101111101001111000101111101100100000001010100011101101100100111001011111100111100101011001000111101011101010001010111011100010100101001011000011101111101001101010011111101011101100001010100111101100101001111000111001011110 e9bc87eab7a3eca09be793a6eab7a8ecaa96e794b1ec8db9e294a8e6bf9aeabb87ecaea5efa78bec80a8ed9397e79591eba8aee294b0efa6a7ebb0a9eca78e5e
UHC 鼇귣젛瓦귨쪖由썹┨濚껇쮥溜쀨퓗畑먮┰獵방짎^ 11101000101010001000001011101011101000001001011111101000101111111000001011101111101001011001000011101011101001101011110111100111101001101011100111100111101110011000001111101000101010001000001011101010111111101001011111101000101111111000001011101111101001011001000011101011101001101011110111100111101001101011100111100110101000111001101001011110 e8a882eba097e8bf82efa590eba6bde7a6b9e7b983e8a882eafe97e8bf82efa590eba6bde7a6b9e6a39a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)