To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??裕?????瑤??唯?┐誘??堊 1001100011011010001111110011111110010111010101000011111100111111001111110011111100111111111010101010001000111111001111111001011101000010001111111000010010100010100101110101010100111111001111111001101010111111 98da3f3f97543f3f3f3f3feaa23f3f97423f84a297553f3f9abf
EUC-JP 俑??裕??洧??瑤??唯?┐誘??堊 11010000110111000011111100111111110011011011010100111111001111111000111111000111101101000011111100111111111101001010010000111111001111111100110110100011001111111010100010100100110011011011011000111111001111111101010011000001 d0dc3f3fcdb53f3f8fc7b43f3ff4a43f3fcda33fa8a4cdb63f3fd4c1
UTF-8 俑앹늿裕꾤몭洧밸뎐瑤녈끇唯뉛┐誘↔틙堊 111001001011111110010001111011001001010110111001111010111000101010111111111010001010001110010101111010101011111010100100111010111010101010101101111001101011010010100111111010111011000010111000111010111000111010010000111001111001000110100100111010111000010110001000111010111000000110000111111001011001010010101111111010111000100110011011111000101001010010010000111010001010101010011000111000101000011010010100111011011000101110011001111001011010000010001010 e4bf91ec95b9eb8abfe8a395eabea4ebaaade6b4a7ebb0b8eb8e90e791a4eb8588eb8187e594afeb899be29490e8aa98e28694ed8b99e5a08a
UHC 俑앹늿裕꾤몭洧밸뎐瑤녈끇唯뉛┐誘↔틙堊 1110100110110101100111011110110010001000100010001110101110101110100001001110011110010001100101111110101011111011101110011110101110110101101011111110100011111101101100111110001110000101101110111110101011100110100001111110111110100110101001001110101110101111101000011110101010111010100001101110010010111110 e9b59dec8888ebae84e79197eafbb9ebb5afe8fdb3e385bbeae687efa6a4ebafa1eaba86e4be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)