To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 瓦??汚??焉??餘??節??狎??汚??^ 1000101010100010001111110011111110001001100110000011111100111111111000001000000100111111001111111110100101010000001111110011111110010000110111110011111100111111111000001011111000111111001111111000100110011000001111110011111101011110 8aa23f3f89983f3fe0813f3fe9503f3f90df3f3fe0be3f3f89983f3f5e
EUC-JP 瓦??汚??焉??餘??節??狎??汚??^ 1011010010100100001111110011111110110001111110000011111100111111110111111110000100111111001111111111000110110001001111110011111111000000111000010011111100111111111000001100000000111111001111111011000111111000001111110011111101011110 b4a43f3fb1f83f3fdfe13f3ff1b13f3fc0e13f3fe0c03f3fb1f83f3f5e
UTF-8 瓦븝쉑汚믣넇焉묋윊餘됭옌節㎬윊狎쀦뙌汚믤뿬^ 11100111100100111010011011101011101110001001110111101100100010011001000111100110101100011001101011101011101011111010001111101011100001001000011111100111100001001000100111101011101011001000101111101100100111001000101011101001101001001001100011101011100100001010110111101100100110001000110011100111101011111000000011100011100011101010110011101100100111001000101011100111100010111000111011101100100000001010011011101011100110011000110011100110101100011001101011101011101011111010010011101011101111111010110001011110 e793a6ebb89dec8991e6b19aebafa3eb8487e78489ebac8bec9c8ae9a498eb90adec988ce7af80e38eacec9c8ae78b8eec80a6eb998ce6b19aebafa4ebbfac5e
UHC 瓦븝쉑汚믣넇焉묋윊餘됭옌節㎬윊狎쀦뙌汚믤뿬^ 11101000101111111011101011101111101111011010011111100111111111011001001011100101100001101001011111100101111010101001000111101000100111111001001011100110101011101000100111101000101111111011101011101111101111011010011111101000100111111001001011100100111001001001011111100110100011001001000111100111111111011001001011100110100101111010110001011110 e8bfbaefbda7e7fd92e58697e5ea91e89f92e6ae89e8bfbaefbda7e89f92e4e497e68c91e7fd92e697ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)