To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 怏??認??楯??[怏??認??楯??[^ 100111001000100100111111001111111001010001000110001111110011111110001111011111000011111100111111010110111001110010001001001111110011111110010100010001100011111100111111100011110111110000111111001111110101101101011110 9c893f3f94463f3f8f7c3f3f5b9c893f3f94463f3f8f7c3f3f5b5e
EUC-JP 怏??認??楯??[怏??認??楯??[^ 110101111110100100111111001111111100011110100111001111110011111110111101110111010011111100111111010110111101011111101001001111110011111111000111101001110011111100111111101111011101110100111111001111110101101101011110 d7e93f3fc7a73f3fbddd3f3f5bd7e93f3fc7a73f3fbddd3f3f5b5e
UTF-8 怏억퐱認㏆㎖楯딇뀬[怏억퐱認㏆㎖楯딇뀬[^ 111001101000000010001111111011001001011010110101111011011001000010110001111010001010101010001101111000111000111110000110111000111000111010010110111001101010010110101111111010111001010010000111111010111000000010101100010110111110011010000000100011111110110010010110101101011110110110010000101100011110100010101010100011011110001110001111100001101110001110001110100101101110011010100101101011111110101110010100100001111110101110000000101011000101101101011110 e6808fec96b5ed90b1e8aa8de38f86e38e96e6a5afeb9487eb80ac5be6808fec96b5ed90b1e8aa8de38f86e38e96e6a5afeb9487eb80ac5b5e
UHC 怏억퐱認㏆㎖楯딇뀬[怏억퐱認㏆㎖楯딇뀬[^ 111001001110100010111110111011111011110110011010111011001110001110100111111011111010011110100010111000101110010010001010111011011000010110100010010110111110010011101000101111101110111110111101100110101110110011100011101001111110111110100111101000101110001011100100100010101110110110000101101000100101101101011110 e4e8beefbd9aece3a7efa7a2e2e48aed85a25be4e8beefbd9aece3a7efa7a2e2e48aed85a25b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)