To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟??肄??惟??嚥〓?悠??循??堊??已 10001100111001010011111100111111111000111110010100111111001111111000100011010010001111110011111110011010100010111000000110101100001111111001011101001001001111110011111110001111011110100011111100111111100110101011111100111111001111111001101111011111 8ce53f3fe3e53f3f88d23f3f9a8b81ac3f97493f3f8f7a3f3f9abf3f3f9bdf
EUC-JP 悟??肄??惟??嚥〓?悠??循??堊??已 10111000111001110011111100111111111001101110011100111111001111111011000011010100001111110011111111010011111010111010001010101110001111111100110110101010001111110011111110111101110110110011111100111111110101001100000100111111001111111101011011100001 b8e73f3fe6e73f3fb0d43f3fd3eba2ae3fcdaa3f3fbddb3f3fd4c13f3fd6e1
UTF-8 悟뽯쉴肄덃끽惟듈뵺嚥〓끃悠뽫솾循뗣뀋堊묐돍已 111001101000001010011111111010111011110110101111111011001000100110110100111010001000001010000100111010111000110110000011111010111000000110111101111001101000001110011111111010111001001110001000111010111011010110111010111001011001101010100101111000111000000010010011111010111000000110000011111001101000001010100000111010111011110110101011111011001000011010111110111001011011111010101010111010111001011110100011111010111000000010001011111001011010000010001010111010111010110010010000111010111000111110001101111001011011011110110010 e6829febbdafec89b4e88284eb8d83eb81bde6839feb9388ebb5bae59aa5e38093eb8183e682a0ebbdabec86bee5beaaeb97a3eb808be5a08aebac90eb8f8de5b7b2
UHC 悟뽯쉴肄덃끽惟듈뵺嚥〓끃悠뽫솾循뗣뀋堊묐돍已 1110011111110110100101101110101110111101101011111110110010111101100010001110011010110011101000111110101011101110101101011110001010010100101110001110011010111111101000011110101110000101101110011110101011101101100101101110011110011001101100101110001011100000100010111110001110000101100001111110010010111110100100011110101110001001100110111110110010101011 e7f696ebbdafecbd88e6b3a3eaeeb5e294b8e6bfa1eb85b9eaed96e799b2e2e08be38587e4be91eb899becab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)