To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 厄μ????宥??勇?Э厄μ????宥??勇?ЭB 1001011011101111100000111100101000111111001111110011111100111111100101110100011100111111001111111001011101000101001111111000010001011110100101101110111110000011110010100011111100111111001111110011111110010111010001110011111100111111100101110100010100111111100001000101111001000010 96ef83ca3f3f3f3f97473f3f97453f845e96ef83ca3f3f3f3f97473f3f97453f845e42
EUC-JP 厄μ?堉??宥??勇?Э厄μ?堉??宥??勇?ЭB 110011001111000110100110110011000011111110001111101101111111110100111111001111111100110110101000001111110011111111001101101001100011111110100111101111111100110011110001101001101100110000111111100011111011011111111101001111110011111111001101101010000011111100111111110011011010011000111111101001111011111101000010 ccf1a6cc3f8fb7fd3f3fcda83f3fcda63fa7bfccf1a6cc3f8fb7fd3f3fcda83f3fcda63fa7bf42
UTF-8 厄μ쥙堉득뿬宥룹쭍勇싳Э厄μ쥙堉득뿬宥룹쭍勇싳ЭB 111001011000111010000100110011101011110011101100101001011001100111100101101000001000100111101011100100111001110111101011101111111010110011100101101011101010010111101011101000111011100111101100101011011000110111100101100010111000011111101100100010111011001111010000101011011110010110001110100001001100111010111100111011001010010110011001111001011010000010001001111010111001001110011101111010111011111110101100111001011010111010100101111010111010001110111001111011001010110110001101111001011000101110000111111011001000101110110011110100001010110101000010 e58e84cebceca599e5a089eb939debbface5aea5eba3b9ecad8de58b87ec8bb3d0ade58e84cebceca599e5a089eb939debbface5aea5eba3b9ecad8de58b87ec8bb3d0ad42
UHC 厄μ쥙堉득뿬宥룹쭍勇싳Э厄μ쥙堉득뿬宥룹쭍勇싳ЭB 11100100111110001010010111101100101000101000111011101011101111001011010111100110100101111010110011101010111010011011011111101100101001111000011011101001101110001001101011101100101011001011111111100100111110001010010111101100101000101000111011101011101111001011010111100110100101111010110011101010111010011011011111101100101001111000011011101001101110001001101011101100101011001011111101000010 e4f8a5eca28eebbcb5e697aceae9b7eca786e9b89aecacbfe4f8a5eca28eebbcb5e697aceae9b7eca786e9b89aecacbf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)