To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 額??鹽??熱??歪ゅ?絶??嚥??預??B 100010100111101000111111001111111110101001100100001111110011111110010100010011010011111100111111100110000110001110000010111000110011111110010000111000100011111100111111100110101000101100111111001111111001011101100001001111110011111101000010 8a7a3f3fea643f3f944d3f3f986382e33f90e23f3f9a8b3f3f97613f3f42
EUC-JP 額??鹽??熱??歪ゅ?絶??嚥??預??B 101100111101101100111111001111111111001111000101001111110011111111000111101011100011111100111111110011111100010010100100111001010011111111000000111001000011111100111111110100111110101100111111001111111100110111000010001111110011111101000010 b3db3f3ff3c53f3fc7ae3f3fcfc4a4e53fc0e43f3fd3eb3f3fcdc23f3f42
UTF-8 額댐풘鹽얏영熱썼뒡歪ゅ졃絶쏁ㅎ嚥든뮵預앶궕B 11101001101000011000110111101011100011001001000011101101100100101001100011101001101110011011110111101100100101101000111111101100100110001000000111100111100001101011000111101100100011011011110011101011100100101010000111100110101011011010101011100011100000101000010111101100101000011000001111100111101101011011011011101100100011111000000111100011100001011000111011100101100110101010010111101011100100111010000011101011101011101011010111101001101000001001000011101100100101011011011011101010101101101001010101000010 e9a18deb8c90ed9298e9b9bdec968fec9881e786b1ec8dbceb92a1e6adaae38285eca183e7b5b6ec8f81e3858ee59aa5eb93a0ebaeb5e9a090ec95b6eab69542
UHC 額댐풘鹽얏영熱썼뒡歪ゅ졃絶쏁ㅎ嚥든뮵預앶궕B 11100100111111101011010011101111101111101001101111100111101001001011111011100110101111111011010111100110111100001011110111101000100010101001110111101000111000001010101011100101101000001011010011101111101111101001101111100111101001001011111011100110101111111011010111100111100100101011110111100111111010001001110111101001100000101010101001000010 e4feb4efbe9be7a4bee6bfb5e6f0bde88a9de8e0aae5a0b4efbe9be7a4bee6bfb5e792bde7e89de982aa42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)