To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??踰??幽??筌??誼??蟻??沃?? 11100001100111110011111100111111111001101111101000111111001111111001011101001000001111110011111111100010101000110011111100111111100010110110001000111111001111111000101101100001001111110011111110010111100000000011111100111111 e19f3f3fe6fa3f3f97483f3fe2a33f3f8b623f3f8b613f3f97803f3f
EUC-JP 癲??踰??幽??筌??誼??蟻??沃?? 11100010101000010011111100111111111011001111110000111111001111111100110110101001001111110011111111100100101001010011111100111111101101011100001100111111001111111011010111000010001111110011111111001101111000000011111100111111 e2a13f3fecfc3f3fcda93f3fe4a53f3fb5c33f3fb5c23f3fcde03f3f
UTF-8 癲됱빖踰숅솻幽뚯춷筌뚯슦誼녻뀑蟻숆강沃쇱걖 111001111001100110110010111010111001000010110001111010111011100110010110111010001011100010110000111011001000100010000101111011001000011010111011111001011011100110111101111010111001101010101111111011001011011010110111111001111010110110001100111010111001101010101111111011001000101010100110111010001010101010111100111010111000010110111011111010111000000010010001111010001001111110111011111011001000100010000110111010101011000010010101111001101011001010000011111011001000011110110001111010101011000110010110 e799b2eb90b1ebb996e8b8b0ec8885ec86bbe5b9bdeb9aafecb6b7e7ad8ceb9aafec8aa6e8aabceb85bbeb8091e89fbbec8886eab095e6b283ec87b1eab196
UHC 癲됱빖踰숅솻幽뚯춷筌뚯슦誼녻뀑蟻숆강沃쇱걖 111011111010011010001001111011001001010110111000111010111011001010011001111010011001100110110000111010101110101110001100111011001010110110010011111011111010011110001100111011001001101010110000111010111111111010000110111010001000010110001011111010111111110010011001111010101011000010101101111010001010101010111100111011001000000110000001 efa689ec95b8ebb299e999b0eaeb8cecad93efa78cec9ab0ebfe86e8858bebfc99eab0ade8aabcec8181

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)