To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????裔??釗??碎??????倭?? 001111110011111100111111001111111110010111100001001111110011111111111011101110110011111100111111111000011110101000111111001111110011111100111111001111110011111110011000011000000011111100111111 3f3f3f3fe5e13f3ffbbb3f3fe1ea3f3f3f3f3f3f98603f3f
EUC-JP ????裔??釗??碎??????倭?? 00111111001111110011111100111111111010101110001100111111001111111000111111100011101001100011111100111111111000101110110000111111001111110011111100111111001111110011111111001111110000010011111100111111 3f3f3f3feae33f3f8fe3a63f3fe2ec3f3f3f3f3f3fcfc13f3f
UTF-8 捻꽝살돪裔꾨뵃釗섓쭜碎몃룑捻꽝살돇倭얩뒑 111011111010011010100100111010101011110110011101111011001000001010110100111010111000111110101010111010001010001110010100111010101011111010101000111010111011010110000011111010011000011110010111111011001000010010010011111011001010110110011100111001111010001010001110111010111010101010000011111010111010001110010001111011111010011010100100111010101011110110011101111011001000001010110100111010111000111110000111111001011000000010101101111011001001011010101001111010111001001010010001 efa6a4eabd9dec82b4eb8faae8a394eabea8ebb583e98797ec8493ecad9ce7a28eebaa83eba391efa6a4eabd9dec82b4eb8f87e580adec96a9eb9291
UHC 捻꽝살돪裔꾨뵃釗섓쭜碎몃룑捻꽝살돇倭얩뒑 11100110111101111011001011001110101110111110110010001001101011011110011111100000100001001110101110010100100010011110000111110010100110001110111110100111100100101110000111101111101110001110101110001111100011101110011011110111101100101100111010111011111011001000100110011000111010001101111010111110111011011000101010001110 e6f7b2cebbec89ade7e084eb9489e1f298efa792e1efb8eb8f8ee6f7b2cebbec8998e8debeed8a8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)