To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鵝?????音??奄??愉??釉??筌λ?B 1110101001000000001111110011111100111111001111110011111110001001101110010011111100111111100010011000001000111111001111111001011011111001001111110011111111100111110101100011111100111111111000101010001110000011110010010011111101000010 ea403f3f3f3f3f89b93f3f89823f3f96f93f3fe7d63f3fe2a383c93f42
EUC-JP 鵝?????音??奄??愉??釉??筌λ?B 1111001110100001001111110011111100111111001111110011111110110010101110110011111100111111101100011110001000111111001111111100110011111011001111110011111111101110110110000011111100111111111001001010010110100110110010110011111101000010 f3a13f3f3f3f3fb2bb3f3fb1e23f3fccfb3f3feed83f3fe4a5a6cb3f42
UTF-8 鵝숈뮄鱗득룚音쀫옜奄몃맩愉껆븦釉먮뼊筌λ넳B 111010011011010110011101111011001000100010001000111010111010111010000100111011111010011110110010111010111001001110011101111010111010001110011010111010011001111110110011111011001000000010101011111011001001100010011100111001011010010110000100111010111010101010000011111010111010011110101001111001101000010010001001111010101011101110000110111010111011100010100110111010011000011110001001111010111010100010101110111010111011110010001010111001111010110110001100110011101011101111101011100001001011001101000010 e9b59dec8888ebae84efa7b2eb939deba39ae99fb3ec80abec989ce5a584ebaa83eba7a9e68489eabb86ebb8a6e98789eba8aeebbc8ae7ad8ccebbeb84b342
UHC 鵝숈뮄鱗득룚音쀫옜奄몃맩愉껆븦釉먮뼊筌λ넳B 11100100101111011001100111101100100100101001001111101100111001111011010111100110100011111001011011101011111001011001011111101011101111111011111111100101111100101011100011101011100100001011000111101010111100001000001111100111100101011000111111101011101110001001000011101011100101101001001011101111101001111010010111101011100001101011001001000010 e4bd99ec9293ece7b5e68f96ebe597ebbfbfe5f2b8eb90b1eaf083e7958febb890eb9692efa7a5eb86b242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)