To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 躍??毅??猿??[躍??毅??猿??[^ 100101101111010000111111001111111000101101000010001111110011111110001001100011100011111100111111010110111001011011110100001111110011111110001011010000100011111100111111100010011000111000111111001111110101101101011110 96f43f3f8b423f3f898e3f3f5b96f43f3f8b423f3f898e3f3f5b5e
EUC-JP 躍??毅??猿??[躍??毅??猿??[^ 110011001111011000111111001111111011010110100011001111110011111110110001111011100011111100111111010110111100110011110110001111110011111110110101101000110011111100111111101100011110111000111111001111110101101101011110 ccf63f3fb5a33f3fb1ee3f3f5bccf63f3fb5a33f3fb1ee3f3f5b5e
UTF-8 躍녠퍏毅덂립猿놁삁[躍녠퍏毅덂립猿놁삁[^ 111010001011101010001101111010111000010110100000111011011000110110001111111001101010111110000101111010111000110110000010111010111010011010111101111001111000110010111111111010111000011010000001111011001000001010000001010110111110100010111010100011011110101110000101101000001110110110001101100011111110011010101111100001011110101110001101100000101110101110100110101111011110011110001100101111111110101110000110100000011110110010000010100000010101101101011110 e8ba8deb85a0ed8d8fe6af85eb8d82eba6bde78cbfeb8681ec82815be8ba8deb85a0ed8d8fe6af85eb8d82eba6bde78cbfeb8681ec82815b5e
UHC 躍녠퍏毅덂립猿놁삁[躍녠퍏毅덂립猿놁삁[^ 111001011011100010110011111010101011101110000110111010111111011010001000111001011011100010110011111010101011101110000110111011001001100010001000010110111110010110111000101100111110101010111011100001101110101111110110100010001110010110111000101100111110101010111011100001101110110010011000100010000101101101011110 e5b8b3eabb86ebf688e5b8b3eabb86ec98885be5b8b3eabb86ebf688e5b8b3eabb86ec98885b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)