To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????I}?????????I{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100100101111101001111110011111100111111001111110011111100111111001111110011111100111111010010010111101101011110 3f3f3f3f3f3f3f3f3f497d3f3f3f3f3f3f3f3f3f497b5e
SJIS-WIN ???筌?????I}???筌?????I{^ 00111111001111110011111111100010101000110011111100111111001111110011111100111111010010010111110100111111001111110011111111100010101000110011111100111111001111110011111100111111010010010111101101011110 3f3f3fe2a33f3f3f3f3f497d3f3f3fe2a33f3f3f3f3f497b5e
EUC-JP ???筌?????I}???筌?????I{^ 00111111001111110011111111100100101001010011111100111111001111110011111100111111010010010111110100111111001111110011111111100100101001010011111100111111001111110011111100111111010010010111101101011110 3f3f3fe4a53f3f3f3f3f497d3f3f3fe4a53f3f3f3f3f497b5e
UTF-8 閱곕젚筌잌낡溜붾젶I}閱곕젚筌잌낡溜붾젶I{^ 1110100110010110101100011110101010110011100101011110110010100000100110101110011110101101100011001110110010011110100011001110101110000010101000011110111110100111100010111110101110110110101111101110110010100000101101100100100101111101111010011001011010110001111010101011001110010101111011001010000010011010111001111010110110001100111011001001111010001100111010111000001010100001111011111010011110001011111010111011011010111110111011001010000010110110010010010111101101011110 e996b1eab395eca09ae7ad8cec9e8ceb82a1efa78bebb6beeca0b6497de996b1eab395eca09ae7ad8cec9e8ceb82a1efa78bebb6beeca0b6497b5e
UHC 閱곕젚筌잌낡溜붾젶I}閱곕젚筌잌낡溜붾젶I{^ 1110011011110011101100001110101110100000100101101110111110100111100111111110010110110011101100001110101011111110100101001110101110100000101010100100100101111101111001101111001110110000111010111010000010010110111011111010011110011111111001011011001110110000111010101111111010010100111010111010000010101010010010010111101101011110 e6f3b0eba096efa79fe5b3b0eafe94eba0aa497de6f3b0eba096efa79fe5b3b0eafe94eba0aa497b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)