To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??濡??域??乳??韋??億??? 1110000110011111001111110011111110001011010110000011111100111111100101000100011100111111001111111000100011100110001111110011111110010011111110110011111100111111111010001110100000111111001111111000100110101101001111110011111100111111 e19f3f3f8b583f3f94473f3f88e63f3f93fb3f3fe8e83f3f89ad3f3f3f
EUC-JP 癲??宜??濡??域??乳??韋??億??? 1110001010100001001111110011111110110101101110010011111100111111110001111010100000111111001111111011000011101000001111110011111111000110111111010011111100111111111100001110101000111111001111111011001010101111001111110011111100111111 e2a13f3fb5b93f3fc7a83f3fb0e83f3fc6fd3f3ff0ea3f3fb2af3f3f3f
UTF-8 癲덈챶宜룝슭濡녈렊域뱀쉸乳면쪛韋몃쳷億됯쐼吏 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111010001110011101111011001000101010101101111001101011111110100001111010111000010110001000111010111010000010001010111001011001111110011111111010111011000110000000111011001000100110111000111001001011100110110011111010111010100110110100111011001010101010011011111010011001111110001011111010111010101010000011111011001011001110110111111001011000010010000100111010111001000010101111111011001001000010111100111011111010011110011110 e799b2eb8d88ecb1b6e5ae9ceba39dec8aade6bfa1eb8588eba08ae59f9febb180ec89b8e4b9b3eba9b4ecaa9be99f8bebaa83ecb3b7e58484eb90afec90bcefa79e
UHC 癲덈챶宜룝슭濡녈렊域뱀쉸乳면쪛韋몃쳷億됯쐼吏 1110111110100110100010001110101110101010100000111110101111110001101101111110010010111101101111101110101110100001101100111110001110001110101000011110011010110100101110011110110010011010100011101110101011100001101110001110100110100101100101001110101011011111101110001110101110101011100110101110010111100010100010011110101010111110101000101110110010100111 efa688ebaa83ebf1b7e4bdbeeba1b3e38ea1e6b4b9ec9a8eeae1b8e9a594eadfb8ebab9ae5e289eabea2eca7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)