To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 罌??援℡?音??}罌??援℡?音??{^ 1110001110100000001111110011111110001001100001111000011110000100001111111000100110111001001111110011111101111101111000111010000000111111001111111000100110000111100001111000010000111111100010011011100100111111001111110111101101011110 e3a03f3f898787843f89b93f3f7de3a03f3f898787843f89b93f3f7b5e
EUC-JP 罌??援??音??}罌??援??音??{^ 111001101010001000111111001111111011000111100111001111110011111110110010101110110011111100111111011111011110011010100010001111110011111110110001111001110011111100111111101100101011101100111111001111110111101101011110 e6a23f3fb1e73f3fb2bb3f3f7de6a23f3fb1e73f3fb2bb3f3f7b5e
UTF-8 罌삘넂援℡섧音욍뀅}罌삘넂援℡섧音욍뀅{^ 111001111011110110001100111011001000001010011000111010111000010010000010111001101000111110110100111000101000010010100001111011001000010010100111111010011001111110110011111011001001101010001101111010111000000010000101011111011110011110111101100011001110110010000010100110001110101110000100100000101110011010001111101101001110001010000100101000011110110010000100101001111110100110011111101100111110110010011010100011011110101110000000100001010111101101011110 e7bd8cec8298eb8482e68fb4e284a1ec84a7e99fb3ec9a8deb80857de7bd8cec8298eb8482e68fb4e284a1ec84a7e99fb3ec9a8deb80857b5e
UHC 罌삘넂援℡섧音욍뀅}罌삘넂援℡섧音욍뀅{^ 111001011010001010111011111000101000011010010010111010101011010110100010111001011011110010110101111010111110010110111111111000111000010110000001011111011110010110100010101110111110001010000110100100101110101010110101101000101110010110111100101101011110101111100101101111111110001110000101100000010111101101011110 e5a2bbe28692eab5a2e5bcb5ebe5bfe385817de5a2bbe28692eab5a2e5bcb5ebe5bfe385817b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)