To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???一??儀?┃嚥△?吟??矣??堊 001111110011111100111111100010001110101000111111001111111000101101010110001111111000010010101011100110101000101110000001101000100011111110001011111000010011111100111111111000011110000100111111001111111001101010111111 3f3f3f88ea3f3f8b563f84ab9a8b81a23f8be13f3fe1e13f3f9abf
EUC-JP ???一??儀?┃嚥△?吟??矣??堊 001111110011111100111111101100001110110000111111001111111011010110110111001111111010100010101101110100111110101110100010101001000011111110110110111000110011111100111111111000101110001100111111001111111101010011000001 3f3f3fb0ec3f3fb5b73fa8add3eba2a43fb6e33f3fe2e33f3fd4c1
UTF-8 捻뀁궠一룩퉪儀먮┃嚥△뫗吟묊솾矣뚯돇堊 111011111010011010100100111010111000000010000001111010101011011010100000111001001011100010000000111010111010001110101001111011011000100110101010111001011000010010000000111010111010100010101110111000101001010010000011111001011001101010100101111000101001011010110011111010111010101110010111111001011001000010011111111010111010110010001010111011001000011010111110111001111001111110100011111010111001101010101111111010111000111110000111111001011010000010001010 efa6a4eb8081eab6a0e4b880eba3a9ed89aae58480eba8aee29483e59aa5e296b3ebab97e5909febac8aec86bee79fa3eb9aafeb8f87e5a08a
UHC 捻뀁궠一룩퉪儀먮┃嚥△뫗吟묊솾矣뚯돇堊 1110011011110111101100101110110010000010101100111110110011101001101101111110100010111001100000101110101111110000100100001110101110100110101011011110011010111111101000011110001010010001101110011110101111100001100100011110011110011001101100101110101111111000100011001110110010001001100110001110010010111110 e6f7b2ec82b3ece9b7e8b982ebf090eba6ade6bfa1e291b9ebe191e799b2ebf88cec8998e4be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)