To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 日?精?姙?臧?逸?各日?精?姙?臧?逸?各^ 1001001111111010001111111001000010111000001111111001101101001011001111111110010001101000001111111000100011101101001111111000101001100101100100111111101000111111100100001011100000111111100110110100101100111111111001000110100000111111100010001110110100111111100010100110010101011110 93fa3f90b83f9b4b3fe4683f88ed3f8a6593fa3f90b83f9b4b3fe4683f88ed3f8a655e
EUC-JP 日?精?姙?臧?逸?各日?精?姙?臧?逸?各^ 1100011011111100001111111100000010111010001111111101010110101100001111111110011111001001001111111011000011101111001111111011001111000110110001101111110000111111110000001011101000111111110101011010110000111111111001111100100100111111101100001110111100111111101100111100011001011110 c6fc3fc0ba3fd5ac3fe7c93fb0ef3fb3c6c6fc3fc0ba3fd5ac3fe7c93fb0ef3fb3c65e
UTF-8 日렮精렖姙렔臧렎逸쇨各日렮精렖姙렔臧렎逸쇨各^ 11100110100101111010010111101011101000001010111011100111101100101011111011101011101000001001011011100101101001111001100111101011101000001001010011101000100001111010011111101011101000001000111011101001100000001011100011101100100001111010100011100101100100001000010011100110100101111010010111101011101000001010111011100111101100101011111011101011101000001001011011100101101001111001100111101011101000001001010011101000100001111010011111101011101000001000111011101001100000001011100011101100100001111010100011100101100100001000010001011110 e697a5eba0aee7b2beeba096e5a799eba094e887a7eba08ee980b8ec87a8e59084e697a5eba0aee7b2beeba096e5a799eba094e887a7eba08ee980b8ec87a8e590845e
UHC 日렮精렖姙렔臧렎逸쇨各日렮精렖姙렔臧렎逸쇨各^ 111011001110110110001110101110111110111111110001100011101010101111101100111101011000111010101001111011011111010110001110101001001110110011101111101111001110101011001010110000001110110011101101100011101011101111101111111100011000111010101011111011001111010110001110101010011110110111110101100011101010010011101100111011111011110011101010110010101100000001011110 eced8ebbeff18eabecf58ea9edf58ea4ecefbceacac0eced8ebbeff18eabecf58ea9edf58ea4ecefbceacac05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)