To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN ???穩??穩э?z???穩??穩э?zB 001111110011111100111111111000100111001000111111001111111110001001110010100001001000111100111111011110100011111100111111001111111110001001110010001111110011111111100010011100101000010010001111001111110111101001000010 3f3f3fe2723f3fe272848f3f7a3f3f3fe2723f3fe272848f3f7a42
EUC-JP 獒??穩??穩э?z獒??穩??穩э?zB 10001111110010111011101100111111001111111110001111010011001111110011111111100011110100111010011111101111001111110111101010001111110010111011101100111111001111111110001111010011001111110011111111100011110100111010011111101111001111110111101001000010 8fcbbb3f3fe3d33f3fe3d3a7ef3f7a8fcbbb3f3fe3d33f3fe3d3a7ef3f7a42
UTF-8 獒뀐슭穩뚳슈穩э슛z獒뀐슭穩뚳슈穩э슛zB 11100111100011011001001011101011100000001001000011101100100010101010110111100111101010011010100111101011100110101011001111101100100010101000100011100111101010011010100111010001100011011110110010001010100110110111101011100111100011011001001011101011100000001001000011101100100010101010110111100111101010011010100111101011100110101011001111101100100010101000100011100111101010011010100111010001100011011110110010001010100110110111101001000010 e78d92eb8090ec8aade7a9a9eb9ab3ec8a88e7a9a9d18dec8a9b7ae78d92eb8090ec8aade7a9a9eb9ab3ec8a88e7a9a9d18dec8a9b7a42
UHC 獒뀐슭穩뚳슈穩э슛z獒뀐슭穩뚳슈穩э슛zB 111010001010001110110010111011111011110110111110111010001011000110001100111011111011110110110100111010001011000110101100111011111011110110111000011110101110100010100011101100101110111110111101101111101110100010110001100011001110111110111101101101001110100010110001101011001110111110111101101110000111101001000010 e8a3b2efbdbee8b18cefbdb4e8b1acefbdb87ae8a3b2efbdbee8b18cefbdb4e8b1acefbdb87a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)