To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 爭耕?箏?檣????命爭耕?箏?檣????明^ 111000001010010110001101011010110011111111100010101101010011111110011110111111000011111100111111001111110011111110010110101111011110000010100101100011010110101100111111111000101011010100111111100111101111110000111111001111110011111100111111100101101011111001011110 e0a58d6b3fe2b53f9efc3f3f3f3f96bde0a58d6b3fe2b53f9efc3f3f3f3f96be5e
EUC-JP 爭耕?箏?檣?獐??命爭耕?箏?檣?獐??明^ 11100000101001111011100111001100001111111110010010110111001111111101110011111110001111111000111111001011101110100011111100111111110011001011111111100000101001111011100111001100001111111110010010110111001111111101110011111110001111111000111111001011101110100011111100111111110011001100000001011110 e0a7b9cc3fe4b73fdcfe3f8fcbba3f3fccbfe0a7b9cc3fe4b73fdcfe3f8fcbba3f3fccc05e
UTF-8 爭耕뉵箏렊檣렡獐곈렜命爭耕뉵箏렊檣렡獐곈렜明^ 11100111100010001010110111101000100000001001010111101011100010011011010111100111101011101000111111101011101000001000101011100110101010101010001111101011101000001010000111100111100011011001000011101010101100111000100011101011101000001001110011100101100100011011110111100111100010001010110111101000100000001001010111101011100010011011010111100111101011101000111111101011101000001000101011100110101010101010001111101011101000001010000111100111100011011001000011101010101100111000100011101011101000001001110011100110100110001000111001011110 e788ade88095eb89b5e7ae8feba08ae6aaa3eba0a1e78d90eab388eba09ce591bde788ade88095eb89b5e7ae8feba08ae6aaa3eba0a1e78d90eab388eba09ce6988e5e
UHC 爭耕뉵箏렊檣렡獐곈렜命爭耕뉵箏렊檣렡獐곈렜明^ 111011101011001111001100111010011011010010111011111011101011010010001110101000011110110111101010100011101011001011101101111011111011000011101001100011101010111011011001101001001110111010110011110011001110100110110100101110111110111010110100100011101010000111101101111010101000111010110010111011011110111110110000111010011000111010101110110110011010010101011110 eeb3cce9b4bbeeb48ea1edea8eb2edefb0e98eaed9a4eeb3cce9b4bbeeb48ea1edea8eb2edefb0e98eaed9a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)