To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 誤??踰??管悠??蟻?????愉ユ????B 100011001110101100111111001111111110011011111010001111110011111110001010110001111001011101001001001111110011111110001011011000010011111100111111001111110011111100111111100101101111100110000011100001100011111100111111001111110011111101000010 8ceb3f3fe6fa3f3f8ac797493f3f8b613f3f3f3f3f96f983863f3f3f3f42
EUC-JP 誤??踰??管悠??蟻??孼??愉ユ?洧??B 10111000111011010011111100111111111011001111110000111111001111111011010011001001110011011010101000111111001111111011010111000010001111110011111110001111101110101100001100111111001111111100110011111011101001011110011000111111100011111100011110110100001111110011111101000010 b8ed3f3fecfc3f3fb4c9cdaa3f3fb5c23f3f8fbac33f3fccfba5e63f8fc7b43f3f42
UTF-8 誤곸룆踰앮끽管悠끾쨫蟻볦컺孼꾨챷愉ユ갭洧좊퓷B 11101000101010101010010011101010101100111011100011101011101000111000011011101000101110001011000011101100100101011010111011101011100000011011110111100111101011101010000111100110100000101010000011101011100000011011111011101100101010001010101111101000100111111011101111101011101100111010011011101100101110111011101011100101101011011011110011101010101111101010100011101100101100011011011111100110100001001000100111100011100000111010011011101010101100001010110111100110101101001010011111101100101000101000101011101101100100111011011101000010 e8aaa4eab3b8eba386e8b8b0ec95aeeb81bde7aea1e682a0eb81beeca8abe89fbbebb3a6ecbbbae5adbceabea8ecb1b7e68489e383a6eab0ade6b4a7eca28aed93b742
UHC 誤곸룆踰앮끽管悠끾쨫蟻볦컺孼꾨챷愉ユ갭洧좊퓷B 111010001010011010000001111011001000111110000101111010111011001010011101111001101011001110100011110011101011011111101010111011011000010111100110101001001000010111101011111111001001001111101100101100001001101111100101111011011000010011101011101010101000010011101010111100001010101111100110101100001011100011101010111110111010000011101011101111111001110101000010 e8a681ec8f85ebb29de6b3a3ceb7eaed85e6a485ebfc93ecb09be5ed84ebaa84eaf0abe6b0b8eafba0ebbf9d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)