To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??諭??嚴щ????恂レ?筌λ?弛 00111111001111110011111111100010100001100011111100111111100101110100000000111111001111111001101010001110100001001000101100111111001111110011111100111111100111001001011010000011100011000011111111100010101000111000001111001001001111111001001001101111 3f3f3fe2863f3f97403f3f9a8e848b3f3f3f3f9c96838c3fe2a383c93f926f
EUC-JP ???竊??諭??嚴щ????恂レ?筌λ?弛 00111111001111110011111111100011111001100011111100111111110011011010000100111111001111111101001111101110101001111110101100111111001111110011111100111111110101111111011010100101111011000011111111100100101001011010011011001011001111111100001111010000 3f3f3fe3e63f3fcda13f3fd3eea7eb3f3f3f3fd7f6a5ec3fe4a5a6cb3fc3d0
UTF-8 捻뀁뮆竊섇츦諭꾩춲嚴щ쵎劉뀐쫳恂レ탮筌λ맮弛 11101111101001101010010011101011100000001000000111101011101011101000011011100111101010111000101011101100100001001000011111101100101110001010011011101000101010111010110111101010101111101010100111101100101101101011001011100101100110101011010011010001100010011110110010110101100011101110111110100111100001111110101110000000100100001110110010101011101100111110011010000001100000101110001110000011101011001110110110000011101011101110011110101101100011001100111010111011111010111010011110101110111001011011110010011011 efa6a4eb8081ebae86e7ab8aec8487ecb8a6e8abadeabea9ecb6b2e59ab4d189ecb58eefa787eb8090ecabb3e68182e383aced83aee7ad8ccebbeba7aee5bc9b
UHC 捻뀁뮆竊섇츦諭꾩춲嚴щ쵎劉뀐쫳恂レ탮筌λ맮弛 1110011011110111101100101110110010010010100101011110111110111100100110001110010110101110100111001110101110110001100001001110110010101101100011101110010111110001101011001110101110101100100100001110101011100101101100101110111110100110100010111110001011100001101010111110110010110101100011101110111110100111101001011110101110010000101101011110110010101100 e6f7b2ec9295efbc98e5ae9cebb184ecad8ee5f1acebac90eae5b2efa68be2e1abecb58eefa7a5eb90b5ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)