To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??誼③??l?癲??誼?????獄 1110000110011111001111110011111110001011011000101000011101000010001111110011111110000010100011000011111111100001100111110011111100111111100010110110001000111111001111110011111100111111001111111000110110010110 e19f3f3f8b6287423f3f828c3fe19f3f3f8b623f3f3f3f3f8d96
EUC-JP 癲??誼??洹l?癲??誼??洧??獄 1110001010100001001111110011111110110101110000110011111100111111100011111100011110111010101000111110110000111111111000101010000100111111001111111011010111000011001111110011111110001111110001111011010000111111001111111011100111110110 e2a13f3fb5c33f3f8fc7baa3ec3fe2a13f3fb5c33f3f8fc7b43f3fb9f6
UTF-8 癲뗢뫀誼③펶洹l졁癲⒲굞誼⑼쭓洧뱀퐡獄 111001111001100110110010111010111001011110100010111010111010101110000000111010001010101010111100111000101001000110100010111011011000111010110110111001101011010010111001111011111011110110001100111011001010000110000001111001111001100110110010111000101001001010110010111010101011010110011110111010001010101010111100111000101001000110111100111011001010110110010011111001101011010010100111111010111011000110000000111011011001000010100001111001111000110110000100 e799b2eb97a2ebab80e8aabce291a2ed8eb6e6b4b9efbd8ceca181e799b2e292b2eab59ee8aabce291bcecad93e6b4a7ebb180ed90a1e78d84
UHC 癲뗢뫀誼③펶洹l졁癲⒲굞誼⑼쭓洧뱀퐡獄 1110111110100110100010111110001010010001101001001110101111111110101010001110100110111100100001111110101010110111101000111110110010100000101100101110111110100110101010011110001110000010100001101110101111111110101010011110111110100111100010111110101011111011101110011110110010111101100010101110100010101011 efa68be291a4ebfea8e9bc87eab7a3eca0b2efa6a9e38286ebfea9efa78beafbb9ecbd8ae8ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)