To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壯よ?辱?ズ遙??墻??????ユ?厭 100110101110000110000010111001100011111110010000010010100011111110000011010110011110101010100001001111110011111110011010110101000011111100111111001111110011111100111111001111111000001110000110001111111000100101111101 9ae182e63f904a3f8359eaa13f3f9ad43f3f3f3f3f3f83863f897d
EUC-JP 壯よ?辱?ズ遙??墻??????ユ?厭 110101001110001110100100111010000011111110111111101010110011111110100101101110101111010010100011001111110011111111010100110101100011111100111111001111110011111100111111001111111010010111100110001111111011000111011110 d4e3a4e83fbfab3fa5baf4a33f3fd4d63f3f3f3f3f3fa5e63fb1de
UTF-8 壯よ나辱녺ズ遙쒒눈墻쇔ㅁ廉뉛풖樂ユ튇厭 111001011010001110101111111000111000001010001000111010111000001010011000111010001011111010110001111010111000010110111010111000111000001010111010111010011000000110011001111011001001001010010010111010111000100010001000111001011010001010111011111011001000011110010100111000111000010110000001111011111010011010100010111010111000100110011011111011011001001010010110111011111010011010111111111000111000001110100110111011011000101010000111111001011000111010101101 e5a3afe38288eb8298e8beb1eb85bae382bae98199ec9292eb8888e5a2bbec8794e38581efa6a2eb899bed9296efa6bfe383a6ed8a87e58ead
UHC 壯よ나辱녺ズ遙쒒눈墻쇔ㅁ廉뉛풖樂ユ튇厭 1110110111100000101010101110100010110011101010101110100110110100100001101110011110101011101110101110100110101011100111001110100110110100101010111110110111011111101111001110010110100100101100011110011011110101100001111110111110111110100110011110100011111001101010111110011010111001100111001110011011110100 ede0aae8b3aae9b486e7abbae9ab9ce9b4abeddfbce5a4b1e6f587efbe99e8f9abe6b99ce6f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)