To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蒸陌??昶拯?止?蒸陌??昶拯?止?^ 1000111111110110111010001001100100111111001111111001110111100010100111010110110100111111100011100111111000111111100011111111011011101000100110010011111100111111100111011110001010011101011011010011111110001110011111100011111101011110 8ff6e8993f3f9de29d6d3f8e7e3f8ff6e8993f3f9de29d6d3f8e7e3f5e
EUC-JP 蒸陌??昶拯?止?蒸陌??昶拯?止?^ 1011111011111000111011111111100100111111001111111101101011100100110110011100111000111111101110111101111100111111101111101111100011101111111110010011111100111111110110101110010011011001110011100011111110111011110111110011111101011110 bef8eff93f3fdae4d9ce3fbbdf3fbef8eff93f3fdae4d9ce3fbbdf3f5e
UTF-8 蒸陌렔랑昶拯렩止렜蒸陌렔랑昶拯렩止렜^ 11101000100100101011100011101001100110011000110011101011101000001001010011101011100111101001000111100110100110001011011011100110100010111010111111101011101000001010100111100110101011011010001011101011101000001001110011101000100100101011100011101001100110011000110011101011101000001001010011101011100111101001000111100110100110001011011011100110100010111010111111101011101000001010100111100110101011011010001011101011101000001001110001011110 e892b8e9998ceba094eb9e91e698b6e68bafeba0a9e6ada2eba09ce892b8e9998ceba094eb9e91e698b6e68bafeba0a9e6ada2eba09c5e
UHC 蒸陌렔랑昶拯렩止렜蒸陌렔랑昶拯렩止렜^ 11110001111110101101100011101000100011101010100110110110111110111111001111100100111100011111010110001110101101111111001010101101100011101010111011110001111110101101100011101000100011101010100110110110111110111111001111100100111100011111010110001110101101111111001010101101100011101010111001011110 f1fad8e88ea9b6fbf3e4f1f58eb7f2ad8eaef1fad8e88ea9b6fbf3e4f1f58eb7f2ad8eae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)