To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ??????哀??v??????哀??vB 0011111100111111001111110011111100111111001111111000100010100011001111110011111101110110001111110011111100111111001111110011111100111111100010001010001100111111001111110111011001000010 3f3f3f3f3f3f88a33f3f763f3f3f3f3f3f88a33f3f7642
EUC-JP 濚?????哀??v濚?????哀??vB 100011111100100110100001001111110011111100111111001111110011111110110000101001010011111100111111011101101000111111001001101000010011111100111111001111110011111100111111101100001010010100111111001111110111011001000010 8fc9a13f3f3f3f3fb0a53f3f768fc9a13f3f3f3f3fb0a53f3f7642
UTF-8 濚믥킋溜쀫젛哀잙젘v濚믥킋溜쀫젛哀잙젘vB 111001101011111110011010111010111010111110100101111011011000001010001011111011111010011110001011111011001000000010101011111011001010000010011011111001011001001110000000111011001001111010011001111011001010000010011000011101101110011010111111100110101110101110101111101001011110110110000010100010111110111110100111100010111110110010000000101010111110110010100000100110111110010110010011100000001110110010011110100110011110110010100000100110000111011001000010 e6bf9aebafa5ed828befa78bec80abeca09be59380ec9e99eca09876e6bf9aebafa5ed828befa78bec80abeca09be59380ec9e99eca0987642
UHC 濚믥킋溜쀫젛哀잙젘v濚믥킋溜쀫젛哀잙젘vB 111001111011100110010010111001111011010010010111111010101111111010010111111010111010000010010111111001001110111010011111111010111010000010010100011101101110011110111001100100101110011110110100100101111110101011111110100101111110101110100000100101111110010011101110100111111110101110100000100101000111011001000010 e7b992e7b497eafe97eba097e4ee9feba09476e7b992e7b497eafe97eba097e4ee9feba0947642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)