To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????H 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f48
SJIS-WIN 瓮??澳??節??搖??瓮??澳??絶??饒??H 111000010100010000111111001111111110000001010011001111110011111110010000110111110011111100111111100111011000101000111111001111111110000101000100001111110011111111100000010100110011111100111111100100001110001000111111001111111110100101100000001111110011111101001000 e1443f3fe0533f3f90df3f3f9d8a3f3fe1443f3fe0533f3f90e23f3fe9603f3f48
EUC-JP 瓮??澳??節??搖??瓮??澳??絶??饒??H 111000011010010100111111001111111101111110110100001111110011111111000000111000010011111100111111110110011110101000111111001111111110000110100101001111110011111111011111101101000011111100111111110000001110010000111111001111111111000111000001001111110011111101001000 e1a53f3fdfb43f3fc0e13f3fd9ea3f3fe1a53f3fdfb43f3fc0e43f3ff1c13f3f48
UTF-8 瓮륅슭澳묇퉿節뀐슥搖억쉠瓮륅슭澳묈넇絶욑슁饒뽳슴H 11100111100100111010111011101011101001011000010111101100100010101010110111100110101111101011001111101011101011001000011111101101100010011011111111100111101011111000000011101011100000001001000011101100100010101010010111100110100100001001011011101100100101101011010111101100100010011010000011100111100100111010111011101011101001011000010111101100100010101010110111100110101111101011001111101011101011001000100011101011100001001000011111100111101101011011011011101100100110101001000111101100100010101000000111101001101001011001001011101011101111011011001111101100100010101011010001001000 e793aeeba585ec8aade6beb3ebac87ed89bfe7af80eb8090ec8aa5e69096ec96b5ec89a0e793aeeba585ec8aade6beb3ebac88eb8487e7b5b6ec9a91ec8a81e9a592ebbdb3ec8ab448
UHC 瓮륅슭澳묇퉿節뀐슥搖억쉠瓮륅슭澳묈넇絶욑슁饒뽳슴H 11101000101101111000111111101111101111011011111011100111111111101001000111100100101110011001011111101111101111011011001011101111101111011011101111101000111101001011111011101111101111011010101011101000101101111000111111101111101111011011111011100111111111101001000111100101100001101001011111101111101111101001111011101111101111011011001111101001101011101001011011101111101111011011111101001000 e8b78fefbdbee7fe91e4b997efbdb2efbdbbe8f4beefbdaae8b78fefbdbee7fe91e58697efbe9eefbdb3e9ae96efbdbf48

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)