To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??悠ィ?悠ァ?岩??悠ィ?悠ァ?閼^ 0011111100111111100101110100100110000011010000100011111110010111010010011000001101000000001111111000101011100010001111110011111110010111010010011000001101000010001111111001011101001001100000110100000000111111111010001000010001011110 3f3f974983423f974983403f8ae23f3f974983423f974983403fe8845e
EUC-JP ??悠ィ?悠ァ?岩??悠ィ?悠ァ?閼^ 0011111100111111110011011010101010100101101000110011111111001101101010101010010110100001001111111011010011100100001111110011111111001101101010101010010110100011001111111100110110101010101001011010000100111111111011111110010001011110 3f3fcdaaa5a33fcdaaa5a13fb4e43f3fcdaaa5a33fcdaaa5a13fefe45e
UTF-8 룶끝悠ィ룫悠ァ룶岩룶끝悠ィ룫悠ァ룶閼^ 11101011101000111011011011101011100000011001110111100110100000101010000011100011100000101010001111101011101000111010101111100110100000101010000011100011100000101010000111101011101000111011011011100101101100101010100111101011101000111011011011101011100000011001110111100110100000101010000011100011100000101010001111101011101000111010101111100110100000101010000011100011100000101010000111101011101000111011011011101001100101101011110001011110 eba3b6eb819de682a0e382a3eba3abe682a0e382a1eba3b6e5b2a9eba3b6eb819de682a0e382a3eba3abe682a0e382a1eba3b6e996bc5e
UHC 룶끝悠ィ룫悠ァ룶岩룶끝悠ィ룫悠ァ룶閼^ 10001111101010111011001110100001111010101110110110101011101000111000111110100010111010101110110110101011101000011000111110101011111001001101101110001111101010111011001110100001111010101110110110101011101000111000111110100010111010101110110110101011101000011000111110101011111001001101100101011110 8fabb3a1eaedaba38fa2eaedaba18fabe4db8fabb3a1eaedaba38fa2eaedaba18fabe4d95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)