To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ?ワ?爰??節??[?ワ?爰??節??[^ 001111111000001110001111001111111110000010100111001111110011111110010000110111110011111100111111010110110011111110000011100011110011111111100000101001110011111100111111100100001101111100111111001111110101101101011110 3f838f3fe0a73f3f90df3f3f5b3f838f3fe0a73f3f90df3f3f5b5e
EUC-JP ?ワ?爰??節??[?ワ?爰??節??[^ 001111111010010111101111001111111110000010101001001111110011111111000000111000010011111100111111010110110011111110100101111011110011111111100000101010010011111100111111110000001110000100111111001111110101101101011110 3fa5ef3fe0a93f3fc0e13f3f5b3fa5ef3fe0a93f3fc0e13f3f5b5e
UTF-8 曆ワ퐣爰볠략節뚭틩[曆ワ퐣爰볠략節뚭틩[^ 111011111010011010001011111000111000001110101111111011011001000010100011111001111000100010110000111010111011001110100000111010111001111010110101111001111010111110000000111010111001101010101101111011011000101110101001010110111110111110100110100010111110001110000011101011111110110110010000101000111110011110001000101100001110101110110011101000001110101110011110101101011110011110101111100000001110101110011010101011011110110110001011101010010101101101011110 efa68be383afed90a3e788b0ebb3a0eb9eb5e7af80eb9aaded8ba95befa68be383afed90a3e788b0ebb3a0eb9eb5e7af80eb9aaded8ba95b5e
UHC 曆ワ퐣爰볠략節뚭틩[曆ワ퐣爰볠략節뚭틩[^ 111001101011011110101011111011111011110110001100111010101011101010010011111001101011011110101011111011111011110110001100111010101011101010010011010110111110011010110111101010111110111110111101100011001110101010111010100100111110011010110111101010111110111110111101100011001110101010111010100100110101101101011110 e6b7abefbd8ceaba93e6b7abefbd8ceaba935be6b7abefbd8ceaba93e6b7abefbd8ceaba935b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)