To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖??揖??矣??誤??游??袁μ?癲??弛 10011111010100000011111100111111100101110100101100111111001111111110000111100001001111110011111110001100111010110011111100111111100111111110000000111111001111111110010111001101100000111100101000111111111000011001111100111111001111111001001001101111 9f503f3f974b3f3fe1e13f3f8ceb3f3f9fe03f3fe5cd83ca3fe19f3f3f926f
EUC-JP 蘖??揖??矣??誤??游??袁μ?癲??弛 11011101101100010011111100111111110011011010110000111111001111111110001011100011001111110011111110111000111011010011111100111111110111101110001000111111001111111110101011001111101001101100110000111111111000101010000100111111001111111100001111010000 ddb13f3fcdac3f3fe2e33f3fb8ed3f3fdee23f3feacfa6cc3fe2a13f3fc3d0
UTF-8 蘖뽰눦揖닷쮦矣묒돭誤곸옃游룟젾袁μ맠癲욧남弛 1110100010011000100101101110101110111101101100001110101110001000101001101110011010001111100101101110101110001011101101111110110010101110101001101110011110011111101000111110101110101100100100101110101110001111101011011110100010101010101001001110101010110011101110001110110010011000100000111110011010111000101110001110101110100011100111111110110010100000101111101110100010100010100000011100111010111100111010111010011110100000111001111001100110110010111011001001101010100111111010111000001010101000111001011011110010011011 e89896ebbdb0eb88a6e68f96eb8bb7ecaea6e79fa3ebac92eb8fade8aaa4eab3b8ec9883e6b8b8eba39feca0bee8a281cebceba7a0e799b2ec9aa7eb82a8e5bc9b
UHC 蘖뽰눦揖닷쮦矣묒돭誤곸옃游룟젾袁μ맠癲욧남弛 1110010111101110100101101110110010000111101111011110101111100111101101001110010110101000100000111110101111111000100100011110110010001001101100001110100010100110100000011110110010011110100011111110101011111101101101111110010110100000101100001110101010111110101001011110110010010000101011011110111110100110101111111110101010110011101100101110110010101100 e5ee96ec87bdebe7b4e5a883ebf891ec89b0e8a681ec9e8feafdb7e5a0b0eabea5ec90adefa6bfeab3b2ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)