To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???與??嗚??絶??厓??節?????^ 001111110011111100111111111001000110111100111111001111111001101001101010001111110011111110010000111000100011111100111111111110101000110100111111001111111001000011011111001111110011111100111111001111110011111101011110 3f3f3fe46f3f3f9a6a3f3f90e23f3ffa8d3f3f90df3f3f3f3f3f5e
EUC-JP ???與??嗚??絶??厓??節??渶??^ 001111110011111100111111111001111101000000111111001111111101001111001011001111110011111111000000111001000011111100111111100011111011010011000111001111110011111111000000111000010011111100111111100011111100011111101101001111110011111101011110 3f3f3fe7d03f3fd3cb3f3fc0e43f3f8fb4c73f3fc0e13f3f8fc7ed3f3f5e
UTF-8 欌붷뤀與잞쉼嗚밭퇅絶욐뜑厓길쯃節계뮰渶뽳풙^ 11100110101011001000110011101011101101101011011111101011101001001000000011101000100010001000011111101100100111101001111011101100100010011011110011100101100101111001101011101011101100001010110111101101100001111000010111100111101101011011011011101100100110101001000011101011100111001001000111100101100011101001001111101010101110001011100011101100101011111000001111100111101011111000000011101010101100111000010011101011101011101011000011100110101110001011011011101011101111011011001111101101100100101001100101011110 e6ac8cebb6b7eba480e88887ec9e9eec89bce5979aebb0aded8785e7b5b6ec9a90eb9c91e58e93eab8b8ecaf83e7af80eab384ebaeb0e6b8b6ebbdb3ed92995e
UHC 欌붷뤀與잞쉼嗚밭퇅絶욐뜑厓길쯃節계뮰渶뽳풙^ 11101101111010111001010011100101100011111011000111100110101010001001111111101111101111011011000011100111111100001011100111100111101101111001011011101111101111101001111011101110100011011001010011100100111011011011000111100110101010001001111111101111101111011011000011101000100100101011100111100111101101111001011011101111101111101001110001011110 edeb94e58fb1e6a89fefbdb0e7f0b9e7b796efbe9eee8d94e4edb1e6a89fefbdb0e892b9e7b796efbe9c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)