To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????億???????????億??B 001111110011111100111111001111110011111100111111001111110011111100111111100010011010110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000100110101101001111110011111101000010 3f3f3f3f3f3f3f3f3f89ad3f3f3f3f3f3f3f3f3f3f3f89ad3f3f42
EUC-JP ?????????億???????????億??B 001111110011111100111111001111110011111100111111001111110011111100111111101100101010111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011001010101111001111110011111101000010 3f3f3f3f3f3f3f3f3fb2af3f3f3f3f3f3f3f3f3f3f3fb2af3f3f42
UTF-8 囹덈슚栒롥첎類앺닰億됲넞囹덈슚栒롥첎類앺닰億됲넞B 11101111101001101010100111101011100011011000100011101100100010101001101011100110101000001001001011101011101000011010010111101100101100101000111011101111101001111001000011101100100101011011101011101011100010111011000011100101100001001000010011101011100100001011001011101011100001001001111011101111101001101010100111101011100011011000100011101100100010101001101011100110101000001001001011101011101000011010010111101100101100101000111011101111101001111001000011101100100101011011101011101011100010111011000011100101100001001000010011101011100100001011001011101011100001001001111001000010 efa6a9eb8d88ec8a9ae6a092eba1a5ecb28eefa790ec95baeb8bb0e58484eb90b2eb849eefa6a9eb8d88ec8a9ae6a092eba1a5ecb28eefa790ec95baeb8bb0e58484eb90b2eb849e42
UHC 囹덈슚栒롥첎類앺닰億됲넞囹덈슚栒롥첎類앺닰億됲넞B 11100111101010101000100011101011100110101010100011100010111000111000111011100101101010101001101111101011101110101001110111101101100010001010011011100101111000101000100111101101100001101010001011100111101010101000100011101011100110101010100011100010111000111000111011100101101010101001101111101011101110101001110111101101100010001010011011100101111000101000100111101101100001101010001001000010 e7aa88eb9aa8e2e38ee5aa9bebba9ded88a6e5e289ed86a2e7aa88eb9aa8e2e38ee5aa9bebba9ded88a6e5e289ed86a242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)