To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 誤??韋??循?????油わⅧ音???碎?ケ 10001100111010110011111100111111111010001110100000111111001111111000111101111010001111110011111100111111001111110011111110010110111110111000001011101101100001110101101110001001101110010011111100111111001111111110000111101010001111111000001101010000 8ceb3f3fe8e83f3f8f7a3f3f3f3f3f96fb82ed875b89b93f3f3fe1ea3f8350
EUC-JP 誤??韋??循?????油わ?音???碎?ケ 101110001110110100111111001111111111000011101010001111110011111110111101110110110011111100111111001111110011111100111111110011001111110110100100111011110011111110110010101110110011111100111111001111111110001011101100001111111010010110110001 b8ed3f3ff0ea3f3fbddb3f3f3f3f3fccfda4ef3fb2bb3f3f3fe2ec3fa5b1
UTF-8 誤곸룆韋귝쨫循녿겱亮쎄퍔油わⅧ音거딀궇碎쇱ケ 111010001010101010100100111010101011001110111000111010111010001110000110111010011001111110001011111010101011011110011101111011001010100010101011111001011011111010101010111010111000010110111111111010101011001010110001111011111010010110110111111011001000111010000100111011011000110110010100111001101011001010111001111000111000001010001111111000101000010110100111111010011001111110110011111010101011000110110000111010111001010010000000111010101011011010000111111001111010001010001110111011001000011110110001111000111000001010110001 e8aaa4eab3b8eba386e99f8beab79deca8abe5beaaeb85bfeab2b1efa5b7ec8e84ed8d94e6b2b9e3828fe285a7e99fb3eab1b0eb9480eab687e7a28eec87b1e382b1
UHC 誤곸룆韋귝쨫循녿겱亮쎄퍔油わⅧ音거딀궇碎쇱ケ 1110100010100110100000011110110010001111100001011110101011011111100000101110011010100100100001011110001011100000100001101110101110000001101111011110010110111001101111011110101010111011100010111110101011111010101010101110111110100101101101111110101111100101101100001100010110001010111001101000001010100000111000011110111110111100111011001010101110110001 e8a681ec8f85eadf82e6a485e2e086eb81bde5b9bdeabb8beafaaaefa5b7ebe5b0c58ae682a0e1efbcecabb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)