To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竣?ε雨?俎?????爰?茁?彫?衣? 100011110111011000111111100000111100001110001001010010100011111110011000110101110011111100111111001111110011111100111111111000001010011100111111111110111001001100111111100100101010010000111111100010001101111100111111 8f763f83c3894a3f98d73f3f3f3f3fe0a73ffb933f92a43f88df3f
EUC-JP 竣?ε雨?俎?????爰?茁?彫?衣? 10111101110101110011111110100110110001011011000110101011001111111101000011011001001111110011111100111111001111110011111111100000101010010011111110001111110101111101111000111111110001001010011000111111101100001110000100111111 bdd73fa6c5b1ab3fd0d93f3f3f3f3fe0a93f8fd7de3fc4a63fb0e13f
UTF-8 竣ㅸε雨렋俎닺렓쿰렰렜爰렪茁렭彫렪衣렟 1110011110101011101000111110001110000101101110001100111010110101111010011001101110101000111010111010000010001011111001001011111110001110111010111000101110111010111010111010000010010011111011001011111110110000111010111010000010110000111010111010000010011100111001111000100010110000111010111010000010101010111010001000110010000001111010111010000010101101111001011011110110101011111010111010000010101010111010001010000110100011111010111010000010011111 e7aba3e385b8ceb5e99ba8eba08be4bf8eeb8bbaeba093ecbfb0eba0b0eba09ce788b0eba0aae88c81eba0ade5bdabeba0aae8a1a3eba09f
UHC 竣ㅸε雨렋俎닺렓쿰렰렜爰렪茁렭彫렪衣렟 1111000111100010101001001110100010100101111001011110100111101011100011101010001011110000101110111011010011101000100011101010100011000100111100011000111010111101100011101010111011101010101110101000111010111000111100011110100010001110101110101111000011000001100011101011100011101011111111011000111010110000 f1e2a4e8a5e5e9eb8ea2f0bbb4e88ea8c4f18ebd8eaeeaba8eb8f1e88ebaf0c18eb8ebfd8eb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)