To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 訝??耶?オ餓??h訝??耶?オ餓?? 111001100110001000111111001111111001011011101011001111111000001101001001100010011110110000111111001111110110100011100110011000100011111100111111100101101110101100111111100000110100100110001001111011000011111100111111 e6623f3f96eb3f834989ec3f3f68e6623f3f96eb3f834989ec3f3f
EUC-JP 訝??耶?オ餓??h訝??耶?オ餓?? 111010111100001100111111001111111100110011101101001111111010010110101010101100101110111000111111001111110110100011101011110000110011111100111111110011001110110100111111101001011010101010110010111011100011111100111111 ebc33f3fcced3fa5aab2ee3f3f68ebc33f3fcced3fa5aab2ee3f3f
UTF-8 訝밥퓱耶섊オ餓뽪쓳h訝밥퓱耶섊オ餓뽪쓳 11101000101010001001110111101011101100001010010111101101100100111011000111101000100000001011011011101100100001001000101011100011100000101010101011101001101001001001001111101011101111011010101011101100100100111011001101101000111010001010100010011101111010111011000010100101111011011001001110110001111010001000000010110110111011001000010010001010111000111000001010101010111010011010010010010011111010111011110110101010111011001001001110110011 e8a89debb0a5ed93b1e880b6ec848ae382aae9a493ebbdaaec93b368e8a89debb0a5ed93b1e880b6ec848ae382aae9a493ebbdaaec93b3
UHC 訝밥퓱耶섊オ餓뽪쓳h訝밥퓱耶섊オ餓뽪쓳 11100100101110001011100111100100101111111001011111100101101011011001100011100111101010111010101011100100101110111001011011100110100111011001000101101000111001001011100010111001111001001011111110010111111001011010110110011000111001111010101110101010111001001011101110010110111001101001110110010001 e4b8b9e4bf97e5ad98e7abaae4bb96e69d9168e4b8b9e4bf97e5ad98e7abaae4bb96e69d91

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)