To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????ぜ乙??閻??源??柔レ?哀?? 00111111001111110011111100111111001111111000001010111010100010011011001100111111001111111110100010000101001111110011111110001100101110010011111100111111100011110101111110000011100011000011111110001000101000110011111100111111 3f3f3f3f3f82ba89b33f3fe8853f3f8cb93f3f8f5f838c3f88a33f3f
EUC-JP ?????ぜ乙??閻??源??柔レ?哀?? 00111111001111110011111100111111001111111010010010111100101100101011010100111111001111111110111111100101001111110011111110111000101110110011111100111111101111011100000010100101111011000011111110110000101001010011111100111111 3f3f3f3f3fa4bcb2b53f3fefe53f3fb8bb3f3fbdc0a5ec3fb0a53f3f
UTF-8 嶺뚭낫流쒑ぜ乙쇄뵥閻롫똾源당춯柔レ탧哀잙췀 111011111010011010101011111010111001101010101101111010111000001010101011111011111010011110001010111011001001001010010001111000111000000110011100111001001011100110011001111011001000011110000100111010111011010110100101111010011001011010111011111010111010000110101011111010111001100010111110111001101011101010010000111010111000101110111001111011001011011010101111111001101001111110010100111000111000001110101100111011011000001110100111111001011001001110000000111011001001111010011001111011001011011110000000 efa6abeb9aadeb82abefa78aec9291e3819ce4b999ec8784ebb5a5e996bbeba1abeb98bee6ba90eb8bb9ecb6afe69f94e383aced83a7e59380ec9e99ecb780
UHC 嶺뚭낫流쒑ぜ乙쇄뵥閻롫똾源당춯柔レ탧哀잙췀 111001111010110110001100111010101011001110110100111010101111110010011100111010001010101010111100111010111110000010111100111000101001010010100100111001111010001010001110111010111000110010000100111010101011100110110100111001111010110110001100111010101111010110101011111011001011010110001001111001001110111010011111111010111010110110011100 e7ad8ceab3b4eafc9ce8aabcebe0bce294a4e7a28eeb8c84eab9b4e7ad8ceaf5abecb589e4ee9febad9c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)