To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 訝?????耶??v訝?????耶??vB 11100110011000100011111100111111001111110011111100111111100101101110101100111111001111110111011011100110011000100011111100111111001111110011111100111111100101101110101100111111001111110111011001000010 e6623f3f3f3f3f96eb3f3f76e6623f3f3f3f3f96eb3f3f7642
EUC-JP 訝?????耶??v訝?????耶??vB 11101011110000110011111100111111001111110011111100111111110011001110110100111111001111110111011011101011110000110011111100111111001111110011111100111111110011001110110100111111001111110111011001000010 ebc33f3f3f3f3fcced3f3f76ebc33f3f3f3f3fcced3f3f7642
UTF-8 訝밧끀掠욄㉬耶섉뜆v訝밧끀掠욄㉬耶섉뜆vB 111010001010100010011101111010111011000010100111111010111000000110000000111011111010010110110101111011001001101010000100111000111000100110101100111010001000000010110110111011001000010010001001111010111001110010000110011101101110100010101000100111011110101110110000101001111110101110000001100000001110111110100101101101011110110010011010100001001110001110001001101011001110100010000000101101101110110010000100100010011110101110011100100001100111011001000010 e8a89debb0a7eb8180efa5b5ec9a84e389ace880b6ec8489eb9c8676e8a89debb0a7eb8180efa5b5ec9a84e389ace880b6ec8489eb9c867642
UHC 訝밧끀掠욄㉬耶섉뜆v訝밧끀掠욄㉬耶섉뜆vB 111001001011100010111001111001011000010110110110111001011011000110011110111001101010100010111101111001011010110110011000111001101000110110001001011101101110010010111000101110011110010110000101101101101110010110110001100111101110011010101000101111011110010110101101100110001110011010001101100010010111011001000010 e4b8b9e585b6e5b19ee6a8bde5ad98e68d8976e4b8b9e585b6e5b19ee6a8bde5ad98e68d897642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)