To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蓊頑・泌蜒頑・比サ吩瘧讌泌蜒頑・比サ竸 11100100111000011000101011100110101001011001010011100101111110001011100111100101100000111000101011100110101001011001010011100100101110111001100111100100111000011000101011100110101001011001010011100101111110001011100111100101100000111000101011100110101001011001010011100100101110111001100101011110 e4e18ae6a594e5f8b9e5838ae6a594e4bb99e4e18ae6a594e5f8b9e5838ae6a594e4bb995e
EUC-JP 蓊頑・泌?蜒頑・比サ吩瘧讌泌?蜒頑・比サ竸 11101000111000111011010011101000100011101010010111001000111001110011111111101001111000111011010011101000100011101010010111001000111001101000111010111011110100101110011011100001111010101110110010100111110010001110011100111111111010011110001110110100111010001000111010100101110010001110011010001110101110111101000110111111 e8e3b4e88ea5c8e73fe9e3b4e88ea5c8e68ebbd2e6e1eaeca7c8e73fe9e3b4e88ea5c8e68ebbd1bf
UTF-8 蓊頑・泌蜒頑・比サ吩瘧讌泌蜒頑・比サ竸 111010001001001110001010111010011010000010010001111011111011110110100101111001101011001110001100111011101001100110011000111010001001110010010010111010011010000010010001111011111011110110100101111001101010111110010100111011111011110110111011111001011001000010101001111001111001100010100111111010001010111010001100111001101011001110001100111011101001100110011000111010001001110010010010111010011010000010010001111011111011110110100101111001101010111110010100111011111011110110111011111001111010101110111000 e8938ae9a091efbda5e6b38cee9998e89c92e9a091efbda5e6af94efbdbbe590a9e798a7e8ae8ce6b38cee9998e89c92e9a091efbda5e6af94efbdbbe7abb8
UHC ?頑?泌??頑?比?吩??泌??頑?比?? 0011111111101000110101110011111111111001101100100011111100111111111010001101011100111111110111011110111100111111110111011100001100111111001111111111100110110010001111110011111111101000110101110011111111011101111011110011111100111111 3fe8d73ff9b23f3fe8d73fddef3fddc33f3ff9b23f3fe8d73fddef3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)