To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??兵????箏???兵????箏?^ 0011111100111111100101011011101000111111001111110011111100111111111000101011010100111111001111110011111110010101101110100011111100111111001111110011111111100010101101010011111101011110 3f3f95ba3f3f3f3fe2b53f3f3f95ba3f3f3f3fe2b53f5e
EUC-JP 邕杻兵焌???箏?邕杻兵焌???箏?^ 1000111111100001111011011000111111000011101001101100101010111100100011111100100111101000001111110011111100111111111001001011011100111111100011111110000111101101100011111100001110100110110010101011110010001111110010011110100000111111001111110011111111100100101101110011111101011110 8fe1ed8fc3a6cabc8fc9e83f3f3fe4b73f8fe1ed8fc3a6cabc8fc9e83f3f3fe4b73f5e
UTF-8 邕杻兵焌ㅹ렪렊箏급邕杻兵焌ㅹ렪렊箏긁^ 11101001100000101001010111100110100111011011101111100101100001011011010111100111100001001000110011100011100001011011100111101011101000001010101011101011101000001000101011100111101011101000111111101010101110001000100111101001100000101001010111100110100111011011101111100101100001011011010111100111100001001000110011100011100001011011100111101011101000001010101011101011101000001000101011100111101011101000111111101010101110001000000101011110 e98295e69dbbe585b5e7848ce385b9eba0aaeba08ae7ae8feab889e98295e69dbbe585b5e7848ce385b9eba0aaeba08ae7ae8feab8815e
UHC 邕杻兵焌ㅹ렪렊箏급邕杻兵焌ㅹ렪렊箏긁^ 11101000101110111101001011101110110111001011001011110001111000001010010011101001100011101011100010001110101000011110111010110100101100011101111011101000101110111101001011101110110111001011001011110001111000001010010011101001100011101011100010001110101000011110111010110100101100011101110001011110 e8bbd2eedcb2f1e0a4e98eb88ea1eeb4b1dee8bbd2eedcb2f1e0a4e98eb88ea1eeb4b1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)