To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??終??咫?い?挺??終??咫?い?挺B 0011111100111111100011110100100100111111001111111001101001000000001111111000001010100010001111111001001011110000001111110011111110001111010010010011111100111111100110100100000000111111100000101010001000111111100100101111000001000010 3f3f8f493f3f9a403f82a23f92f03f3f8f493f3f9a403f82a23f92f042
EUC-JP ??終??咫?い?挺??終??咫?い?挺B 0011111100111111101111011010101000111111001111111101001110100001001111111010010010100100001111111100010011110010001111110011111110111101101010100011111100111111110100111010000100111111101001001010010000111111110001001111001001000010 3f3fbdaa3f3fd3a13fa4a43fc4f23f3fbdaa3f3fd3a13fa4a43fc4f242
UTF-8 룶끝終룶웩咫춳い룫挺룶끝終룶웩咫춳い룫挺B 11101011101000111011011011101011100000011001110111100111101101011000001011101011101000111011011011101100100110111010100111100101100100101010101111101100101101101011001111100011100000011000010011101011101000111010101111100110100011001011101011101011101000111011011011101011100000011001110111100111101101011000001011101011101000111011011011101100100110111010100111100101100100101010101111101100101101101011001111100011100000011000010011101011101000111010101111100110100011001011101001000010 eba3b6eb819de7b582eba3b6ec9ba9e592abecb6b3e38184eba3abe68cbaeba3b6eb819de7b582eba3b6ec9ba9e592abecb6b3e38184eba3abe68cba42
UHC 룶끝終룶웩咫춳い룫挺룶끝終룶웩咫춳い룫挺B 1000111110101011101100111010000111110000111110111000111110101011110000001010000111110010101000011010110110001111101010101010010010001111101000101110111111011000100011111010101110110011101000011111000011111011100011111010101111000000101000011111001010100001101011011000111110101010101001001000111110100010111011111101100001000010 8fabb3a1f0fb8fabc0a1f2a1ad8faaa48fa2efd88fabb3a1f0fb8fabc0a1f2a1ad8faaa48fa2efd842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)