To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 壤??鎰??袁ы?永??壤??鎰??袁ы?永??B 1001101011011111001111110011111111101000010011000011111100111111111001011100110110000100100011010011111110001001011010010011111100111111100110101101111100111111001111111110100001001100001111110011111111100101110011011000010010001101001111111000100101101001001111110011111101000010 9adf3f3fe84c3f3fe5cd848d3f89693f3f9adf3f3fe84c3f3fe5cd848d3f89693f3f42
EUC-JP 壤??鎰??袁ы?永??壤??鎰??袁ы?永??B 1101010011100001001111110011111111101111101011010011111100111111111010101100111110100111111011010011111110110001110010100011111100111111110101001110000100111111001111111110111110101101001111110011111111101010110011111010011111101101001111111011000111001010001111110011111101000010 d4e13f3fefad3f3feacfa7ed3fb1ca3f3fd4e13f3fefad3f3feacfa7ed3fb1ca3f3f42
UTF-8 壤깆쥜鎰숂독袁ы뭶永띠뿝壤깆쥜鎰숂독袁ы뭶永띠뿝B 1110010110100011101001001110101010111001100001101110110010100101100111001110100110001110101100001110110010001000100000101110101110001111100001011110100010100010100000011101000110001011111010111010110110110110111001101011000010111000111010111001110110100000111010111011111110011101111001011010001110100100111010101011100110000110111011001010010110011100111010011000111010110000111011001000100010000010111010111000111110000101111010001010001010000001110100011000101111101011101011011011011011100110101100001011100011101011100111011010000011101011101111111001110101000010 e5a3a4eab986eca59ce98eb0ec8882eb8f85e8a281d18bebadb6e6b0b8eb9da0ebbf9de5a3a4eab986eca59ce98eb0ec8882eb8f85e8a281d18bebadb6e6b0b8eb9da0ebbf9d42
UHC 壤깆쥜鎰숂독袁ы뭶永띠뿝壤깆쥜鎰숂독袁ы뭶永띠뿝B 11100101101111011011000111101100101000101001000111101100111100001001100111100111101101011011011011101010101111101010110011101101100100101000010111100111101101011011011011101100100101111001111111100101101111011011000111101100101000101001000111101100111100001001100111100111101101011011011011101010101111101010110011101101100100101000010111100111101101011011011011101100100101111001111101000010 e5bdb1eca291ecf099e7b5b6eabeaced9285e7b5b6ec979fe5bdb1eca291ecf099e7b5b6eabeaced9285e7b5b6ec979f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)