To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????管悠??猿??鶯??泣??醫?? 00111111001111110011111100111111001111110011111110001010110001111001011101001001001111110011111110001001100011100011111100111111111010011111001000111111001111111000101110000011001111110011111111100111110011100011111100111111 3f3f3f3f3f3f8ac797493f3f898e3f3fe9f23f3f8b833f3fe7ce3f3f
EUC-JP ???佾??管悠??猿??鶯??泣??醫?? 001111110011111100111111100011111011000011111011001111110011111110110100110010011100110110101010001111110011111110110001111011100011111100111111111100101111010000111111001111111011010111100011001111110011111111101110110100000011111100111111 3f3f3f8fb0fb3f3fb4c9cdaa3f3fb1ee3f3ff2f43f3fb5e33f3feed03f3f
UTF-8 麗몃쓷佾쒏끽管悠끿뙼猿곷젒鶯볤쑴泣길룚醫귣룆 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111011001001001010001111111010111000000110111101111001111010111010100001111001101000001010100000111010111000000110111111111010111001100110111100111001111000110010111111111010101011001110110111111011001010000010010010111010011011011010101111111010111011001110100100111011001001000110110100111001101011001110100011111010101011100010111000111010111010001110011010111010011000011010101011111010101011011110100011111010111010001110000110 efa688ebaa83ec93b7e4bdbeec928feb81bde7aea1e682a0eb81bfeb99bce78cbfeab3b7eca092e9b6afebb3a4ec91b4e6b3a3eab8b8eba39ae986abeab7a3eba386
UHC 麗몃쓷佾쒏끽管悠끿뙼猿곷젒鶯볤쑴泣길룚醫귣룆 1110011010110000101110001110101110011101100101001110110011101011100111001110011010110011101000111100111010110111111010101110110110000101111001111000110010111111111010101011101110000001111010111010000010010001111001011010001110010011111010101011111010101001111010111110100010110001111001101000111110010110111011001010001010000010111010111000111110000101 e6b0b8eb9d94eceb9ce6b3a3ceb7eaed85e78cbfeabb81eba091e5a393eabea9ebe8b1e68f96eca282eb8f85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)