To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???瓮i?娃??[???瓮i?娃??[^ 001111110011111100111111111000010100010010000010100010010011111110001000101000010011111100111111010110110011111100111111001111111110000101000100100000101000100100111111100010001010000100111111001111110101101101011110 3f3f3fe14482893f88a13f3f5b3f3f3fe14482893f88a13f3f5b5e
EUC-JP ???瓮i?娃??[???瓮i?娃??[^ 001111110011111100111111111000011010010110100011111010010011111110110000101000110011111100111111010110110011111100111111001111111110000110100101101000111110100100111111101100001010001100111111001111110101101101011110 3f3f3fe1a5a3e93fb0a33f3f5b3f3f3fe1a5a3e93fb0a33f3f5b5e
UTF-8 僚묌걶瓮i뒔娃쒑랜[僚묌걶瓮i뒔娃쒑랜[^ 111011111010011010111011111010111010110010001100111010101011000110110110111001111001001110101110111011111011110110001001111010111001001010010100111001011010100010000011111011001001001010010001111010111001111010011100010110111110111110100110101110111110101110101100100011001110101010110001101101101110011110010011101011101110111110111101100010011110101110010010100101001110010110101000100000111110110010010010100100011110101110011110100111000101101101011110 efa6bbebac8ceab1b6e793aeefbd89eb9294e5a883ec9291eb9e9c5befa6bbebac8ceab1b6e793aeefbd89eb9294e5a883ec9291eb9e9c5b5e
UHC 僚묌걶瓮i뒔娃쒑랜[僚묌걶瓮i뒔娃쒑랜[^ 111010001110100010010001111010011000000110011100111010001011011110100011111010011000101010010001111010001101111110011100111010001011011110100011010110111110100011101000100100011110100110000001100111001110100010110111101000111110100110001010100100011110100011011111100111001110100010110111101000110101101101011110 e8e891e9819ce8b7a3e98a91e8df9ce8b7a35be8e891e9819ce8b7a3e98a91e8df9ce8b7a35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)