To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????濡??塋??濡??隘?????^ 0011111100111111001111110011111100111111001111111001010001000111001111110011111110011010110010000011111100111111100101000100011100111111001111111110100010100101001111110011111100111111001111110011111101011110 3f3f3f3f3f3f94473f3f9ac83f3f94473f3fe8a53f3f3f3f3f5e
EUC-JP ??????濡??塋??濡??隘?????^ 0011111100111111001111110011111100111111001111111100011110101000001111110011111111010100110010100011111100111111110001111010100000111111001111111111000010100111001111110011111100111111001111110011111101011110 3f3f3f3f3f3fc7a83f3fd4ca3f3fc7a83f3ff0a73f3f3f3f3f5e
UTF-8 溜곕줂溜녑삻濡숇졂塋딅젾濡숇졂隘뀀줂溜녕쳥^ 11101111101001111000101111101010101100111001010111101100101001001000001011101111101001111000101111101011100001011001000111101100100000101011101111100110101111111010000111101100100010001000011111101100101000011000001011100101101000011000101111101011100101001000010111101100101000001011111011100110101111111010000111101100100010001000011111101100101000011000001011101001100110101001100011101011100000001000000011101100101001001000001011101111101001111000101111101011100001011001010111101100101100111010010101011110 efa78beab395eca482efa78beb8591ec82bbe6bfa1ec8887eca182e5a18beb9485eca0bee6bfa1ec8887eca182e99a98eb8080eca482efa78beb8595ecb3a55e
UHC 溜곕줂溜녑삻濡숇졂塋딅젾濡숇졂隘뀀줂溜녕쳥^ 11101010111111101011000011101011101000011001100111101010111111101011001111100101100110001011001011101011101000011001100111101011101000001011001111100111101010111000101011101011101000001011000011101011101000011001100111101011101000001011001111100100111101101011001011101011101000011001100111101010111111101011001111100111101010111000101001011110 eafeb0eba199eafeb3e598b2eba199eba0b3e7ab8aeba0b0eba199eba0b3e4f6b2eba199eafeb3e7ab8a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)