To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 凹∽?撓??絶??庄??絶??節ワ?晤ゎ?^ 1000100110011010100000011110010000111111100111011001101000111111001111111001000011100010001111110011111110001111101011110011111100111111100100001110001000111111001111111001000011011111100000111000111100111111100111011110101110000010111011000011111101011110 899a81e43f9d9a3f3f90e23f3f8faf3f3f90e23f3f90df838f3f9deb82ec3f5e
EUC-JP 凹∽?撓??絶??庄??絶??節ワ?晤ゎ?^ 1011000111111010101000101110011000111111110110011111101000111111001111111100000011100100001111110011111110111110101100010011111100111111110000001110010000111111001111111100000011100001101001011110111100111111110110101110110110100100111011100011111101011110 b1faa2e63fd9fa3f3fc0e43f3fbeb13f3fc0e43f3fc0e1a5ef3fdaeda4ee3f5e
UTF-8 凹∽푺撓뷂쉥絶뽬씕庄볣뙜絶녽뿏節ワ풒晤ゎ넃^ 11100101100001111011100111100010100010001011110111101101100100011011101011100110100100101001001111101011101101111000001011101100100010011010010111100111101101011011011011101011101111011010110011101100100101001001010111100101101110101000010011101011101100111010001111101011100110011001110011100111101101011011011011101011100001011011110111101011101111111000111111100111101011111000000011100011100000111010111111101101100100101001001011100110100110011010010011100011100000101000111011101011100001001000001101011110 e587b9e288bded91bae69293ebb782ec89a5e7b5b6ebbdacec9495e5ba84ebb3a3eb999ce7b5b6eb85bdebbf8fe7af80e383afed9292e699a4e3828eeb84835e
UHC 凹∽푺撓뷂쉥絶뽬씕庄볣뙜絶녽뿏節ワ풒晤ゎ넃^ 11101000111010101010000111101111101111101000011011101000111101011001010011101111101111011010101111101111101111101001011011101000100111011010101011101101111001001001001111101001100011001010000111101111101111101000011011101001100101111001010011101111101111011010101111101111101111101001011011100111111110111010101011101110100001101001001101011110 e8eaa1efbe86e8f594efbdabefbe96e89daaede493e98ca1efbe86e99794efbdabefbe96e7fbaaee86935e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)