To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???猿??耶?????恂??音??喩??B 001111110011111100111111100010011000111000111111001111111001011011101011001111110011111100111111001111110011111110011100100101100011111100111111100010011011100100111111001111111001101001100111001111110011111101000010 3f3f3f898e3f3f96eb3f3f3f3f3f9c963f3f89b93f3f9a673f3f42
EUC-JP ???猿??耶?????恂??音??喩??B 001111110011111100111111101100011110111000111111001111111100110011101101001111110011111100111111001111110011111111010111111101100011111100111111101100101011101100111111001111111101001111001000001111110011111101000010 3f3f3fb1ee3f3fcced3f3f3f3f3fd7f63f3fb2bb3f3fd3c83f3f42
UTF-8 捻꿔깙猿딆뇚耶껊돃劉㎫솾恂묎퍓音끻춢喩뽯눉B 11101111101001101010010011101010101111111001010011101010101110011001100111100111100011001011111111101011100101001000011011101011100001111001101011101000100000001011011011101010101110111000101011101011100011111000001111101111101001111000011111100011100011101010101111101100100001101011111011100110100000011000001011101011101011001000111011101101100011011001001111101001100111111011001111101011100000011011101111101100101101101010001011100101100101101010100111101011101111011010111111101011100010001000100101000010 efa6a4eabf94eab999e78cbfeb9486eb879ae880b6eabb8aeb8f83efa787e38eabec86bee68182ebac8eed8d93e99fb3eb81bbecb6a2e596a9ebbdafeb888942
UHC 捻꿔깙猿딆뇚耶껊돃劉㎫솾恂묎퍓音끻춢喩뽯눉B 11100110111101111011001011100011100000111001000111101010101110111000101011101100100001111000010111100101101011011000001111101011100010011001011011101010111001011010011111100111100110011011001011100010111000011001000111101010101110111000101011101011111001011000010111100101101011011000001111101010111001111001011011101011100001111010011101000010 e6f7b2e38391eabb8aec8785e5ad83eb8996eae5a7e799b2e2e191eabb8aebe585e5ad83eae796eb87a742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)