To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鼇??節?????鸚??鹽??^ 1110101010000111001111110011111110010000110111110011111100111111001111110011111100111111111010100101111100111111001111111110101001100100001111110011111101011110 ea873f3f90df3f3f3f3f3fea5f3f3fea643f3f5e
EUC-JP 鼇??節?????鸚??鹽??^ 1111001111100111001111110011111111000000111000010011111100111111001111110011111100111111111100111100000000111111001111111111001111000101001111110011111101011110 f3e73f3fc0e13f3f3f3f3ff3c03f3ff3c53f3f5e
UTF-8 鼇룟ㅂ節양ㅌ醴싪㉦鸚뀐슭鹽쇠넎^ 11101001101111001000011111101011101000111001111111100011100001011000001011100111101011111000000011101100100101101001000111100011100001011000110011101111101001101011011111101100100010111010101011100011100010011010011011101001101110001001101011101011100000001001000011101100100010101010110111101001101110011011110111101100100001111010000011101011100001001000111001011110 e9bc87eba39fe38582e7af80ec9691e3858cefa6b7ec8baae389a6e9b89aeb8090ec8aade9b9bdec87a0eb848e5e
UHC 鼇룟ㅂ節양ㅌ醴싪㉦鸚뀐슭鹽쇠넎^ 11101000101010001011011111100101101001001011001011101111101111011011111011100111101001001011110011100111111001001001101011101000101010001011011111100101101001001011001011101111101111011011111011100111101001001011110011101000100001101001101001011110 e8a8b7e5a4b2efbdbee7a4bce7e49ae8a8b7e5a4b2efbdbee7a4bce8869a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)