To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壓??踰??衰??嚥〓?悠?い怨j?鵝 1001101011011000001111110011111111100110111110100011111100111111100100001000101000111111001111111001101010001011100000011010110000111111100101110100100100111111100000101010001010001001100001011000001010001010001111111110101001000000 9ad83f3fe6fa3f3f908a3f3f9a8b81ac3f97493f82a28985828a3fea40
EUC-JP 壓??踰??衰??嚥〓?悠?い怨j?鵝 1101010011011010001111110011111111101100111111000011111100111111101111111110101000111111001111111101001111101011101000101010111000111111110011011010101000111111101001001010010010110001111001011010001111101010001111111111001110100001 d4da3f3fecfc3f3fbfea3f3fd3eba2ae3fcdaa3fa4a4b1e5a3ea3ff3a1
UTF-8 壓쇰낄踰ⓩ끽衰⑹넯嚥〓끃悠곮い怨j퐩鵝 111001011010001110010011111011001000011110110000111010111000001010000100111010001011100010110000111000101001001110101001111010111000000110111101111010001010000110110000111000101001000110111001111010111000010010101111111001011001101010100101111000111000000010010011111010111000000110000011111001101000001010100000111010101011001110101110111000111000000110000100111001101000000010101000111011111011110110001010111011011001000010101001111010011011010110011101 e5a393ec87b0eb8284e8b8b0e293a9eb81bde8a1b0e291b9eb84afe59aa5e38093eb8183e682a0eab3aee38184e680a8efbd8aed90a9e9b59d
UHC 壓쇰낄踰ⓩ끽衰⑹넯嚥〓끃悠곮い怨j퐩鵝 1110010011100010101111001110101110110011101001011110101110110010101010001110011010110011101000111110000111110001101010011110110010000110101011101110011010111111101000011110101110000101101110011110101011101101100000011110100010101010101001001110101010110011101000111110101010111101100100101110010010111101 e4e2bcebb3a5ebb2a8e6b3a3e1f1a9ec86aee6bfa1eb85b9eaed81e8aaa4eab3a3eabd92e4bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)