To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 哀??節??純??悟??哀??節??純??悟??^ 100010001010001100111111001111111001000011011111001111110011111110001111100000110011111100111111100011001110010100111111001111111000100010100011001111110011111110010000110111110011111100111111100011111000001100111111001111111000110011100101001111110011111101011110 88a33f3f90df3f3f8f833f3f8ce53f3f88a33f3f90df3f3f8f833f3f8ce53f3f5e
EUC-JP 哀??節??純?Ł悟??哀??節??純?Ł悟??^ 10110000101001010011111100111111110000001110000100111111001111111011110111100011001111111000111110101001101010001011100011100111001111110011111110110000101001010011111100111111110000001110000100111111001111111011110111100011001111111000111110101001101010001011100011100111001111110011111101011110 b0a53f3fc0e13f3fbde33f8fa9a8b8e73f3fb0a53f3fc0e13f3fbde33f8fa9a8b8e73f3f5e
UTF-8 哀잆룜節꾢맅純섏Ł悟뽬돬哀잆룜節꾢맅純섏Ł悟뽫껑^ 1110010110010011100000001110110010011110100001101110101110100011100111001110011110101111100000001110101010111110101000101110101110100111100001011110011110110100100101001110110010000100100011111100010110000001111001101000001010011111111010111011110110101100111010111000111110101100111001011001001110000000111011001001111010000110111010111010001110011100111001111010111110000000111010101011111010100010111010111010011110000101111001111011010010010100111011001000010010001111110001011000000111100110100000101001111111101011101111011010101111101010101110111001000101011110 e59380ec9e86eba39ce7af80eabea2eba785e7b494ec848fc581e6829febbdaceb8face59380ec9e86eba39ce7af80eabea2eba785e7b494ec848fc581e6829febbdabeabb915e
UHC 哀잆룜節꾢맅純섏Ł悟뽬돬哀잆룜節꾢맅純섏Ł悟뽫껑^ 11100100111011101001111111100011100011111001100011101111101111011000010011100101100100001001111111100010111011011001100011101100101010001010100111100111111101101001011011101000100010011010111111100100111011101001111111100011100011111001100011101111101111011000010011100101100100001001111111100010111011011001100011101100101010001010100111100111111101101001011011100111101100101011000101011110 e4ee9fe38f98efbd84e5909fe2ed98eca8a9e7f696e889afe4ee9fe38f98efbd84e5909fe2ed98eca8a9e7f696e7b2b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)