To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?脹??牢珪?厦雰?脹??牢珪?厦雰B 0011111110010010101011110011111100111111100110000101001110001100010111010011111110011001110011011001010110110101001111111001001010101111001111110011111110011000010100111000110001011101001111111001100111001101100101011011010101000010 3f92af3f3f98538c5d3f99cd95b53f92af3f3f98538c5d3f99cd95b542
EUC-JP ?脹??牢珪?厦雰?脹??牢珪?厦雰B 0011111111000100101100010011111100111111110011111011010010110111101111100011111111010010110011111100101010110111001111111100010010110001001111110011111111001111101101001011011110111110001111111101001011001111110010101011011101000010 3fc4b13f3fcfb4b7be3fd2cfcab73fc4b13f3fcfb4b7be3fd2cfcab742
UTF-8 뤋脹쫸샘牢珪뤋厦雰뤋脹쫸샘牢珪뤋厦雰B 11101011101001001000101111101000100001001011100111101100101010111011100011101100100000111001100011100111100010011010001011100111100011111010101011101011101001001000101111100101100011101010011011101001100110111011000011101011101001001000101111101000100001001011100111101100101010111011100011101100100000111001100011100111100010011010001011100111100011111010101011101011101001001000101111100101100011101010011011101001100110111011000001000010 eba48be884b9ecabb8ec8398e789a2e78faaeba48be58ea6e99bb0eba48be884b9ecabb8ec8398e789a2e78faaeba48be58ea6e99bb042
UHC 뤋脹쫸샘牢珪뤋厦雰뤋脹쫸샘牢珪뤋厦雰B 10001111101110111111001111101100101001101000111110111011111110011101011011101111110100001010100010001111101110111111100110111101110111011101010010001111101110111111001111101100101001101000111110111011111110011101011011101111110100001010100010001111101110111111100110111101110111011101010001000010 8fbbf3eca68fbbf9d6efd0a88fbbf9bdddd48fbbf3eca68fbbf9d6efd0a88fbbf9bdddd442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)